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NOVEL EXPRESSION VECTORS AND USES THEREOF 

1. FIELD OF THE INVENTION 
The present invention relates to novel vectors, to DNA vaccines and gene 
5 therapeutics containing said vectors, to methods forthe preparation of the vectors 
and DNA vaccines and gene therapeutics containing the vectors, and to thera- 
peutic uses of said vectors. More specifically, the present invention relates to no- 
vel vectors comprising (a) an expression cassette of a gene of a nuclear- 
anchoring protein, which contains (i) a DNA binding domain capable of binding to 

10 a specific DNA sequence and (ii) a functional domain capable of binding to a nu- 
clear component and (b) a multimerized DNA forming a binding site for the an- 
choring protein of a nuclear-anchoring protein, and optionally (c) one or more ex- 
pression cassettes of a DNA sequence of interest. In particular the invention re- 
lates to vectors that lack a papilloma virus origin of replication. The invention also 

15 relates to vectors that lack an origin of replication functional in a mammalian cell. 
The invention further relates to methods for expressing a DNA sequence of in- 
terest in a subject. 

2. BACKGROUND OF THE INVENTION 

20 Transfer of autologous or heterologous genes into animal or human organ- 

isms with suitable vectors is emerging as a technique with immense potential to 
cure diseases with a genetic background or to prevent or cure infectious dis- 
eases. Several types of viral and non-viral vectors have been developed and 
tested in animals and in human subjects to deliver a gene/genes that are defec- 

25 tive by mutations and therefore non-functional. Examples of such vectors include 
Adenovirus vectors, Herpes virus vectors, Retrovirus vectors, Lentivirus vectors 
and Adeno-associated vectors. 

Vaccination has proven to be a highly effective and economical method to 
prevent a disease caused by infectious agents. Since the introduction of the Vac- 

30 cinia virus as an attenuated vaccine against the smallpox virus (Variola), vaccines 
against a multitude of human pathogens have been developed and taken into 
routine use. Today small pox has been eradicated by vaccinations and the same 
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is to be expected shortly for the poliovirus. Several childhood diseases, such as 
pertussis, diphtheria and tetanus, can be effectively prevented by vaccinations. 

In general, the most successful viral vaccines are live avirulent mutants of 
the disease-causing viruses. The key to the success of this approach is the fact 

5 that a living virus targets the same organs, the same type and similar number of 
cells, and therefore, by multiplying in the recipient, elicits a long-lasting immune 
response without causing the disease or causing only a mild disease. In effect, a 
live attenuated vaccine produces a subclinical infection, the nature's own way of 
immunizing. As a result, a full immune response will be induced, including hu- 

10 moral, cellular and innate responses, providing a long lasting and sometimes a 
life-long immune protection against the pathogen. 

Although live attenuated vaccines are most potent, they can cause harmful 
side effects. Thus, an attenuated viral vaccine can revert to a virulent strain or in 
cases where the attenuated virus is apathogenic in adults it can still cause a dis- 

15 ease in infants or in disabled persons. This is true in the case of viruses causing 
chronic infections, such as Human Immunodeficiency Virus type 1 and 2. Vac- 
cines composed of viral and bacterial proteins or immunogenic peptides are less 
likely to cause unwanted side effects but may not be as potent as the live vac- 
cines. This is especially the case with vaccines against microbes causing chronic 

20 infections, such as certain viruses and intracellular bacteria. 

The strength and type of immune response is, however, also dependent 
on how the viral proteins are processed and how they are presented to the im- 
mune system by antigen presenting cells (APCs), such as macrophages and 
dendritic cells. Protein and peptide antigens are taken up by APCs via endocyto- 

25 sis, processed to small immunogenic peptides through an endosomal pathway 
and presented to T-lymphocytes (T-cells) by MHC (major histocompatibility com- 
plex) class II antigens [in man HLAs (human leukocyte antigens) class II]. In con- 
trast, proteins synthesized de novo in APCs or in possible target cells for an im- 
mune response, will be processed through a cytoplasmic pathway and presented 

30 to T-cells by MHC class I antigens (in man HLAs class I). In general, the presen- 
tation of immunogenic peptides through the class II pathway will lead to the acti- 
vation of the helper/inducer T-cells, which in turn will lead to the activation of B- 
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cells and to antibody response. In contrast, presentation through class I MHC fa- 
vors the induction of cytotoxic T-Iymphocytes (CTLs), which are capable of rec- 
ognition and destruction of virally infected cells. 

In early 1990's, a method to mimic the antigen processing and presenta- 
5 tion that was normally achieved by live attenuated vaccines was introduced [Ul- 
mer, J.B. et al Science 259 (1993) 1745-1749]. It was shown that an injection of 
eukaryotic expression vectors in the form of circular DNA into the muscle induced 
take-up of this DNA by the muscle cells (and probably others) and was able to 
induce the expression of the gene of interest, and to raise an immune response, 
10 especially a cellular immune response in the form of CTLs, to the protein enco- 
ded by the inserted gene. Since that observation, DNA immunization has become 
a standard method to induce immune responses to foreign proteins in 
experimental animals and human studies with several DNA vaccines are under- 
way. 

15 Generally, the DNA vectors used in these vaccine studies contain a clon- 

ing site for the gene of interest, a strong viral promoter, such as the immediate 
early promoter of the CMV virus, in order to drive the expression of the gene of 
interest, a polyadenylation region, and an antibiotic resistance gene and a bacte- 
rial replication origin for the propagation of the DNA vector (plasmid) in bacterial 

20 cells. 

With the vectors described above it is possible to obtain a detectable level 
of expression of the gene of interest after administering the vector to experimen- 
tal animals or to humans, either by a direct injection to muscle or to skin with a 
particle bombardment technique or by applying the vector in a solution directly to 

25 mucous membranes. However, the expression obtained by these vectors is short 
lived: the vectors tend to disappear from the transfected cells little by little and are 
not transferred to daughter cells in a dividing cell population. The short-term ex- 
pression of the gene of interest and limited number of cells targeted are probably 
the major reasons, why only temporary immune responses are observed in sub- 

30 jects immunized with DNA vectors described above. Thus, for example, Boyer et 
al. observed only temporary immune responses to HIV-1 Env and Rev proteins in 
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human subjects, who were immunized several times with a vector similar to the 
those described above [Boyer, J. D., J Infect Dis 181 (2000) 476 - 483]. 

There is a growing interest in developing novel products useful in gene 
therapy and DNA vaccination. For instance papilloma virus vectors carrying the 
5 expression cassette for the gene of interest have been suggested to be useful 
candidates. 

To date more than 70 subtypes of human papilloma viruses (HPVs) and 
many different animal papilloma viruses have been identified [zur Hausen, H. and 
de Villiers E.. Annu Rev Microbiol 48 (1994) 427 - 447; Bernard, H., et al., Curr 
10 Top Microbiol Immunol 1 86 (1 994) 33 - 54]. All papilloma viruses share a similar 
genome organization and the positioning of all of the translational open reading 
frames (ORFs) is highly conserved. 

Papilloma viruses infect squamous epithelial cells of skin or mucosa at dif- 
ferent body sites and induce the formation of benign tumors, which in some cases 
15 can progress to malignancy. The papilloma virus genomes are replicated and 
maintained in the infected cells as multicopy nuclear plasmids. The replication, 
episomal maintenance, expression of the late genes and virus assembly are tight- 
ly coupled to the differentiation of the epithelial tissue: the papilloma virus DNA 
episomal replication takes place during the initial amplificational replication and 
20 the second, i.e. latent, and the third, i.e. vegetative, replications in the differ- 
entiating epithelium [Howley, P. M.; Papillomavirinae: the viruses and their repli- 
cation. In Virology, Fields, B. C, Knipe, D. M. f Howley, P. M., Eds., Lippincott- 
Raven Publishers, Philadelphia, USA, 1996, 2. Edition, p. 2045 - 2076]. 

Two viral factors encoded by the E1 and E2 open reading frames have 
25 been shown to be necessary and sufficient for the initiation of the DNA replication 
from the papilloma virus origin in the cells [Ustav, M. and Stenlund, A., EMBO J 
10 (1991)449 - 57; Ustav, M., etal., EMBO J 10 (1991)4321 -4329; Ustav, E., 
et al., Proc Natl Acad Sci USA 90 (1993) 898 - 902]. 

Functional origins for the initiation of the DNA replication have been de- 
30 fined for BPV1 [Ustav, M., et al., EMBO J 10 (1991) 4321 - 4329], HPV1a 
[Gopalakrishnan, V. and Khan, S., supra], HPV11 [Russell, J., Botchan, M., J Vi- 
rol 69 (1995) 651 -660], HPV18 [Sverdrup, F. and Khan, S., J Virol 69 (1995) 
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1319 - 1323:Sverdrup, F. and Khan, S., J Virol 68 (1994) 505-509] and many 
others. Characteristically, all these origin fragments have a high A/T content, and 
. they contain several overlapping individual E1 protein recognition sequences, 
which together constitute the E1 binding site [Ustav, M., et al., EMBO J 1 0 (1 991 ) 
5 4321 -4329; Holt, S., et al., J Virol 68 (1994) 1094-1 102; Holt, S. and Wilson, 
V., J Virol 69 (1995) 6525 - 3652; Sedman, T., et al. J Virol 71 (1997) 2887 - 
2996]. In addition, these functional origin fragments contain an E2 binding site, 
which is essential for the initiation of DNA replication in vivo in most cases (Ustav, 
E., et al., supra). The E2 protein facilitates the first step of the origin recognition 
10 by E1 . After the initial binding of monomeric E1 to the origin the multimerization of 
E1 is initiated. This leads to the formation of the complex with the ori melting ac- 
tivity. It has been suggested that E2 has no influence on the following stages of 
the initiation of the DNA replication [Lusky, M., et al., Proc Natl Acad Sci USA 91 
(1994)8895-8899]. 

15 The BPV1 E2 ORF encodes three proteins that originate from selective 

promoter usage and alternative mRNA splicing [Lambert, P., et al., Anna Rev 
Genet 22 (1 988) 235 - 258]. All these proteins can form homo- and heterodimers 
with each other and bind specifically to a 12 bp interrupted palindromic sequence 
5-ACCNNNNNNGGT-3' [Androphy, E., et al., Nature 325 (1987) 70 - 739], 

20 There are 1 7 E2 binding sites in the BPV1 genome and up to four sites in 

the HPV genomes, which play a crucial role in the initiation of viral DNA replica- 
tion (Ustav, E M et al., supra) and in the regulation of viral gene expression (How- 
ley, P. M., Papillomavirinae: the viruses and their replication, in Virology, Fields, 
B. C, Knipe, D. M., Howley, P. M., Eds., Philadelphia: Lippincott-Raven Pubtish- 

25 ers, 1996. 2. edition, p. 2045 - 2076). Structural and mutational analyses have 
revealed three distinct functional domains in the full size E2 protein. The N- 
terminal part (residues 1 to 210) is an activation domain for transcription and rep- 
lication. It is followed by the unstructured hinge region (residues 21 1 to 324) and 
the carboxy-terminal DNA binding-dimerization domain (residues 325 to 410) 

30 [Dostatni, N., et al., EMBO J 7 (1988) 3807 - 3816; Haugen, T., et al. EMBO J 7 
(1 988) 4245 - 4253; McBride, A., et al., EMBO J 7 (1988) 533 - 539; McBride, 
A., et al., Proc Natl Acad Sci USA 86 (1989) 510-514]. On the basis of X-ray crys- 
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tallographical data, the DNA binding-dimerization domain of E2 has a structure of 
a dyad-symmetric eight-stranded antiparallel beta barrel, made up of two identical 
"half-barrel" subunits [Hegde, R. f et al., Nature 359 (1992) 505-512; Hegde, R., 
J Nuc! Med 36(6 Suppl) (1995) 25S - 27S]. The functional elements of the trans- 

5 activation domain of E2 have a very high structural integrity as confirmed by mu- 
tational analysis [Abroi, A., et al., J Virol 70 (1996) 6169 - 6179; Brokaw, J., et 
al., J Virol 71 (1996) 23 - 29; Grossel, M., et al., J Virol 70 (1996) 7264 - 7269; 
Ferguson, M. and Botchan, M., J Virol 70 (1996)4193-4199] and by X-ray crystal- 
lography [Harris, S., and Botchan, M.R., Science 284 (1 999) 1673-1677 and Ant- 

10 son, A. et al., Nature 403 (2000) 805-809]. In addition, X-ray crystallography 
shows that the N-terminal domain of the E2 protein forms a dimeric structure, 
where Arg 37 has an important function in dimer formation (Antson, A., et a!., su- 
pra). 

As has been described previously, bovine papillomavirus type 1 E2 protein 

15 in trans and its multiple binding sites in cis are both necessary and sufficient for 
the chromatin attachment of the episomal genetic elements. The phenomenon is 
suggested to provide a mechanism for partitioning viral genome during viral infec- 
tion in the dividing cells [lives, I., et al., J Virol. 73 (1999) 4404-4412]. 

None of the papilloma vectors or other vectors disclosed so far fulfills the 

20 criteria and requirements set forth for an optimal vaccine, which are the same for 
DNA vaccines and for conventional vaccines. (It should be noted that these re- 
quirements are preferred but not necessary for use as a vaccine.) First, an opti- 
mal vaccine must produce protective immunity with minimal adverse effects. Thus 
the vaccine should be devoid of components, which are toxic and/or cause symp- 

25 toms of the disease to the recipient. Second, an optimal vaccine must induce a 
pathogen-specific immune response, i.e. it must elicit a strong and measurable 
immune response to the desired pathogen without causing an immune response 
to other components of the vaccine. These two requirements imply that a vector 
to be used as a DNA vaccine should optimally only express the desired gene(s) 

30 and optimally should not replicate in the host or contain any sequences homolo- 
gous with those of the recipient, since nucleotide sequences that are homologous 
between the vector and the host's genome may effect the integration of the vector 
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into the host's genome. Third, an optimal vaccine must induce a right type of 
immune response; i.e. it must raise both humoral and cellular immune responses 
in order to act on the intracellular and extracellular pathogen. Finally, an optimal 
vaccine must be stable, i.e. it must retain its potency for a sufficiently long time in 
5 the body to raise the immune response in a vaccine formulation for use in various 
demanding circumstances during storage and preparation. Additionally, vaccines 
should be of reasonable price. Further, the route and the method of inoculation 
are important considerations for optimizing a DNA immunization. 

When developing a DNA vaccine the stability of the expression of the de- 

10 sired gene is sometimes a major problem. Thus, the maintenance function or the 
persistance of the vector in the recipient cell has been focused on in the prior art, 
however, often at the cost of the safety. For example, Ohe, Y., et al. ][Hum Gene 
Ther 6(3) (1995) 325-333] disclose a papilloma virus vector capable of stable, 
high-level gene expression, which is suggested for use in gene therapy. Trans- 

15 forming early genes E5, E6, and E7 have been deleted from said vector, but it still 
contains nucleotide sequences encoding other papilloma viral genes, such as the 
E1 and E2 genes, which are involved in the replication of the virus. Thus, the vec- 
tor produces several other papilloma proteins, which may elicit undesired immune 
responses and which induce a risk of the vector's integration in the recipient. Al- 

20 so, the vector is replicable, since it contains the E1 gene. Additionally, it is large in 
size and therefore subject to bacterial modification during preparation. 

International Patent Application PCT/EE96/00004 (WO 97/24451) dis- 
closes vectors capable of a long-term maintenance in a host cell and methods.us- 
ing such vectors for obtaining long-term production of a gene product of interest 

25 in a mammalian host cell, which expresses E1 and E2. These vectors contain a 
minimal origin of replication of a papilloma virus (MO), a Minichromosome Main- 
tenance Element (MME) of a papilloma virus and a gene encoding said gene 
product, the MO and MME consisting of a DNA sequence different from the natu- 
• ral papilloma virus sequence, and in some embodiments the E1 gene: Addition- 

30 ally, vectors containing an MME consisting essentially often E2 binding sites are 
disclosed in some examples. These vectors require the presence of the E1 pro- 
tein either in the host or in the vector for the expression. This imparts the replica- 
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tion function to the vectors. These vectors also express the E1 protein in addition 
to the gene of interest and the E2 protein and contain sequences, such as rabbit 
(3-globin sequences, which are partially homologous to human sequences caus- 
ing a serious risk of integration to human genome, which reduces the potential of 
5 these vectors as DNA vaccines. Additionally, the vectors are unstable due to their 
size (ca 15 kb): at the preparation stage in a bacterial cell, the bacterial replication 
machinery tends to modify the vector by random slicing of the vector, which leads 
to unsatisfactory expression products including products totally lacking the gene 
of interest. 

10 International Patent Application PCT/EE96/00004 (WO 97/24451 ) further 

discloses that E1 and E2 are the only viral proteins necessary for the episomal 
iong-term replication of the vectors. Additionally, the maintenance function of the 
BPV1 genome is associated with the presence of minimal ori (MO), which is 
stated to be necessary, although not sufficient, for the long-term persistence or 

15 the stable maintenance of the vectors the cells. In addition, the cis-elements, i.e. 
the Minichromosome Maintenance Elements of the BPV1, are stated to be re- 
quired for the stable replication of BPV1 . In particular, multimeric E2 binding sites 
(E2BS) are stated to be necessary for the stable maintenance of the vectors. 
. There is a clear need for improved novel vectors, which would be useful as 

20 DNA vaccines. 

An object of the invention is therefore to provide novel vectors, which are 
capable of a long-term maintenance in a large and increasing number of different 
cells of the host's body and thereby capable of providing a stable expression of 
the desired antigen(s). 

25 Another object of the invention is to provide novel vectors, which are main- 

tained for a long period of time in the cells that originally received the vector and 
transferred it to the daughter cells after mitotic cell division. 

Yet another object of the invention is to provide novel vectors, which ex- 
press in addition to the gene or genes of interest preferably only a gene neces- 

30 sary for a long-term maintenance in the recipient cells and thus are devoid of 
components that are toxic or cause symptoms of the disease to the recipient. 
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A further object of the invention is to provide novel vectors, which mimic at- 
tenuated live viral vaccines, especially in their function of multiplying in the body, 
without inducing any considerable signs of disease and without expressing unde- 
sired proteins, which may induce adverse reactions in a host injected with the 
5 DNA vaccine. 

Still a further object of the invention is to provide novel vectors, which do 
not replicate in the recipient. 

Still another object of the invention is to provide novel vectors, which in- 
duce both humoral and cellular immune responses when used as DNA vaccines. 
10 Yet another object of the invention is to provide novel vectors, which are 

suitable for a large-scale production in bacterial cell. 

Yet another object of the invention is to provide novel vectors, which are 
not host specific and thus enable the production in various bacterial cells. 

An additional object of the invention is to provide novel vectors, which are 
15 useful as carrier vectors for a gene or genes of interest, 

A further object of the invention is to provide novel vectors, which are:use- 
ful in gene therapy and as gene therapeutic agents and for the production of 
macromolecular drugs in vivo. 

20 3. SUMMARY OF THE INVENTION 

The present invention discloses novel vectors, which meet the require- 
ments of a carrier vector of a gene or genes of interest or of an optimal DNA vac- 
cination vector and which are preferably devoid of drawbacks and side effects of 
prior art vectors. 

25 The present invention is based on the surprising finding that a vector 

(plasmid) carrying (i) an expression cassette of a DNA sequence encoding a nu- 
clear-anchoring protein, and (ii) multiple copies of high affinity binding sites for 
said nuclear-anchoring protein spreads in proliferating cells. As a result, the 
number of vector-carrying cells increases even without the replication of the vec- 

30 tor. When the vector additionally carries a gene or genes of interest, the number 
of such cells that express a gene or genes of interest similarly increases without 
the replication of the vector. Thus, the vector of the invention lacks a papilloma 
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virus origin of replication. In a preferred embodiment, the vector of the invention 
lacks an origin of replication that functions in a mammalian cell. 

Accordingly, the present invention discloses novel vectors useful as carrier 
vectors of a gene or genes of interest, in DNA vaccination and gene therapy and 
5 as gene therapeutic agents. In a specific embodiment, said vectors are capable 
of spreading and, if desired, of expressing a gene or genes of interest in an in- 
creasing number of cells for an extended time. The vectors of the present inven- 
tion preferably express only a nuclear-anchoring protein, and, if desired, the gene 
or genes of interest, and optionally a selectable marker. However, they prefera- 
10 bly lack any redundant, oncogenically transforming or potentially toxic sequences, 
thereby avoiding a severe drawback of the vectors previously disclosed or sug- 
gested for use as DNA vaccines, i.e. hypersensitivity reactions against other viral 
components. In certain embodiments of the invention, this is achieved by low le- 
vel of the expressed nuclear-anchoring protein in the cells. At the same time, the 
15 vectors of the present invention induce both humoral and cellular immune re- 
sponses, where the gene or genes of interest is included in the vector. 

The vectors of the present invention are advantageous for use both in vitro 
(e.g., in the production level) and in vivo (e.g., vaccination). 

The present invention relates to the subject matter of the invention as set 
20 forth in the attached claims. 

The present invention relates to expression vectors comprising: (a) a DNA 
sequence encoding a nuclear-anchoring protein operatively linked to a heterolo- 
gous promoter, said nuclear-anchoring protein comprising (i) a DNA binding do- 
main which binds to a specific DNA sequence, and (H) a functional domain that 
25 binds to a nuclear component, or a functional equivalent thereof; and (b) a mul- 
timerized DNA sequence forming a binding site for the nuclear anchoring protein, 
wherein said vector lacks a papilloma virus origin of replication. In a preferred 
embodiment a vector of the invention lacks an origin of replication functional in a 
mammalian cell. 

30 In certain embodiments, the nuclear component is mitotic chromatin, the 

nuclear matrix, nuclear domain 10 (ND10), or nuclear domain POD. 
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In certain specific embodiments, the nuclear anchoring-protein is a chro- 
matin-anchoring protein, and said functional domain binds mitotic chromatin. 

In certain embodiments, the nuclear-anchoring protein contains a hinge or 
linker region. 

5 In certain embodiments, the nuclearranchoring protein is a natural protein 

of eukaryotic, prokaryotic, or viral origin. In certain specific embodiments, the 
natural protein is of viral origin. 

In certain embodiments, the nuclear-anchoring protein is a natural protein 
. of eukaryotic origin. 

10 In certain embodiments, the nuclear-anchoring protein is that of a papil- 

loma virus or an Epstein-Barr virus. 

In specific embodiments, the nuclear-anchoring protein is the E2 protein of 
Bovine Papilloma Virus type 1 or Epstein-Barr Virus Nuclear Antigen 1 . 

In a specific embodiment, the nuclear-anchoring protein is the E2 protein 
15 of Bovine Papilloma Virus type 1 . 

In specific embodiments, the nuclear-anchoring protein is a High Mobility 
Group protein. 

In certain embodiments, the nuclear-anchoring protein is a non-natural 
protein. 

20 In certain embodiments, the nuclear-anchoring protein is a recombinant 

protein, a fusion protein, or a protein obtained by molecular modeling techniques. 

In specific embodiments, the recombinant protein, fusion protein, or pro- 
tein obtained by molecular modeling techniques contains any combination of a 
DNA binding domain which binds to said specific DNA sequence and a functional 

25 domain which binds to a nuclear component, wherein said functional domain 
which binds to a nuclear component is that of a papilloma virus, an Epstein-Barr- 
Virus, or a High Mobility Group protein. 

In certain specific embodiments, the recombinant protein, fusion protein, 
or protein obtained by molecular modeling techniques contains any combination 

30 of a DNA binding domain which binds to said specific DNA sequence and a func- 
tional domain which binds to a nuclear component, wherein said functional do- 
main which binds to a nuclear component is that of E2 protein of Bovine Papil- 
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loma Virus type 1, Epstein-Barr Virus Nuclear Antigen 1, or a High Mobility Group 
protein. 

In certain embodiments, the vector further comprises one or more expres- 
sion cassettes of a DNA sequence of interest. 
5 In certain embodiments, the DNA sequence of interest is that of an infec- 

tious pathogen. In certain embodiments, the infectious pathogen is a virus. In 
certain specific embodiments, the virus is selected from the group consisting of 
Human Immunodeficiency Virus (HIV), Herpex Simplex Virus (HSV), Hepatitis C 
Virus, Influenzae Virus, and Enterovirus. 
10 In certain embodiments, the DNA sequence of interest is that of a bacte- 

rium. In certain embodiments, the bacterium is selected from the group consist- 
ing of Chlamydia trachomatis, Mycobacterium tuberculosis, and Mycoplasma 
pneumonia. In a specific embodiment, the bacterium is Salmonella. 

In certain embodiments, the DNA sequence of interest is that of a fungal 
15 pathogen. In certain embodiments, the fungal pathogen is Candida albigans. 

In certain embodiments, the DNA sequence of interest is of HIV origin. 
In specific embodiments, the DNA sequence of interest encodes a non- 
structural regulatory protein of HIV. In more specific embodiments, the non- 
structural regulatory protein of HIV is Nef, Tat and/or Rev. In a specific embodi- 
20 ment, the non-structural regulatory protein of HIV is Nef. 

In certain embodiments, the DNA sequence of interest encodes a struc- 
tural protein of HIV. In a specific embodiment, the DNA sequence of interest is 
the gene encoding HIV gp1 20/gp1 60. 

In certain embodiments, the vector of the invention comprises a first ex- 
25 pression cassette comprising a DNA sequence of interest which encodes Nef, Tat 
and/or Rev, and a second expression cassette comprising a DNA sequence of in- 
terest which encodes Nef, Tat and/or Rev. 

In certain embodiments, the vector of the invention comprises a first ex- 
pression cassette comprising a DNA sequence of interest which encodes Nef, Tat 
30 and/or Rev, and a second expression cassette comprising a DNA sequence of in- 
terest which encodes a structural protein of HIV. 
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In certain embodiments, the DNA sequence of interest encodes a protein 
associated with cancer. 

In certain embodiments, the DNA sequence of interest encodes a protein 
associated with immune maturation, regulation of immune responses, or regula- 
5 tion of autoimmune responses. In a specific embodiment, the protein is APE- 
CED. 

In a specific embodiment, the DNA sequence of interest is the Aire gene. 

In certain embodiments, the DNA sequence of interest encodes a protein 
that is defective in any hereditary single gene disease. 
10 In certain embodiments, the DNA sequence of interest encodes a macro- 

molecular drug. 

In certain embodiments, the DNA sequence of interest encodes a cyto- 
kine. In certain specific embodiments, the cytokine is an interleukin selected from 
the group consisting of IL1 , IL2, IL4, IL6 and IL1 2. In certain other specific em- 
15 bodiments, the DNA sequence of interest encodes an interferon. 

In certain embodiments, the DNA sequence of interest encodes a biologi- 
cally active RNA molecule. In certain specific embodiments, the biologically ac- 
tive RNA molecule is selected from the group consisting of inhibitory antisense 
and ribozyme molecules. In certain specific embodiments, the inhibitory an- 
20 tisense or ribozyme molecules antagonize the function of an oncogene. 

A vector of the invention is suitable for the use for the production of a 
therapeutic macromolecular agent in vivo. 

In certain embodiments, the invention provides a vector for use as a me- 
dicament. 

25 In certain embodiments, the invention provides a vector for use as a car- 

rier vector for a gene, genes, or a DNA.sequence or DNA sequences of interest, 
such as a gene, genes, or a DNA sequence or DNA sequences encoding a pro- 
tein or peptide of an infectious agent, a therapeutic agent, a macromolecular 
drug, or any combination thereof. 

30 In certain specific embodiments, the invention provides a vector for use as 

a medicament for treating inherited or acquired genetic defects. 
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In certain embodiments, the invention provides a vector for use as a 
therapeutic DNA vaccine against an infectious agent. 

in certain embodiments, the invention provides a vector for use as a 
therapeutic agent. 

5 The invention further relates to methods for providing a protein to a sub- 

ject, said method comprising administering to the subject a vector of the inven- 
tion, wherein said vector (i) further comprises a second DNA sequence encoding 
the protein to be provided to the subject, which second DNA sequence is opera- 
bly linked to a second promoter, and (ii) does not encode Bovine Papilloma Virus 

10 protein E1 , and wherein said subject does not express Bovine Papilloma Virus 
protein E1 . 

The invention further relates to methods for inducing an immune response 
to a protein in a subject, said method comprising administering to the subject a 
vector of the invention wherein said vector (i) further comprises a second DNA 
15 sequence encoding said protein, which second DNA sequence is operabiy linked 
to a second promoter, and (ii) does not encode Bovine Papilloma Virus protein 
E1 , and wherein said subject does not express Bovine Papilloma Virus protein 
E1. 

The invention further relates to methods for treating an infectious disease 
20 in a subject in need of said treatment, said method comprising administering to 
said subject a therapeutically effective amount of a vector of the invention, 
wherein the DNA sequence of interest encodes a protein comprising an immuno- 
genic epitope of an infectious agent. 

The invention further relates to methods for treating an inherited or ac- 
25 quired genetic defect in a subject in need of said treatment, said method compris- 
ing: administering to said subject a therapeutically effective amount of a vector of 
the invention, wherein said DNA sequence of interest encodes a protein which is 
affected by said inherited or acquired genetic defect. 

The invention further relates to methods for expressing a DNA sequence 
30 in a subject, said method comprising administering a vector of the invention to 
said subject. 
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The invention further relates to methods for expressing a DNA sequence 
in a subject, treating an inherited or acquired genetic defect, treating an infectious 
disease, inducing an immune-response to a protein, and providing a protein to a 
subject, wherein the vector of the invention does not encode Bovine Papilloma Vi- 
5 rus protein E1 , and wherein said subject does not express Bovine Papilloma Vi- 
rus protein E1 . 

In certain embodiments, a vector of the invention is used for production of 
a protein encoded by said DNA sequence of interest in a cell or an organism. 
The invention further provides a method for the preparation of a vector of 
10 claim 1 , 2, or 17 comprising: (a) cultivating a host cell containing said vector and 
(b) recovering the vector. In a specific embodiment, the method for preparing a 
vector of the invention further comprises before step (a) a step of transforming 
• said host cell with said vector. In certain specific embodiments, the host cell is a 
prokaryotic cell. In a specific embodiment, the host cell is an Escherichia coli. 
1 5 The invention further relates to a host cell that is characterized by contain- 

ing a vector of the invention. In certain embodiments, the host cell is a bacterial 
cell. In a certain other embodiments, the host cell is a mammalian cell. 

The invention further relates to carrier vectors containing a vector of the 
invention. 

20 The invention further relates to a pharmaceutical composition comprising a 

vector of the invention and a suitable pharmaceutical vehicle. 

The invention further relates to a DNA vaccine containing a vector of the 
invention. 

The invention further relates to a gene therapeutic agent containing a vec- 
25 tor of the invention. 

The invention further relates to a method for the preparation of a DNA 
vaccine, said method comprising combining a vector of the invention with a suit- 
able pharmaceutical vehicle. 

The invention further relates to a method for the preparation of an agent 
30 for use in gene therapy, said method comprising combining a vector of the inven- - 
tion with a suitable pharmaceutical vehicle. 
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4. DESCRIPTION OF THE FIGURES 
Figure 1 shows the schematic map of plasmid super6. 
Figure 2 shows the schematic map of plasmid VI. 
Figure 3 shows the schematic map of plasmid II. 
5 Figure 4 shows the expression of the Nef and E2 proteins from the vectors 

super6, super6wt, VI, Vlwt, and II in Jurkat cells. 

Figure 5 shows the schematic map of plasmid productl. 
Figure 6A shows the schematic map of the plasmids NNV-1 and NNV-2 
andTigure 6B shows the schematic map of plasmid and NNV-2wt. 
10 Figure 7 shows the expression of the Nef protein from the plasmids NNV- 

1, NNV-2, NNV-1 wt, NNV-2-wt, super6, and super6wt in Jurkat cells. 

Figure 8 shows the expression of the Nef and E2 proteins from the plas- 
mids NNV-2-wt, NNV-2-wtFS, and product I in Jurkat cells. 

Figure 9 shows the expression of the Nef and E2 proteins from the plas- 
15 mids NNV-2-wt, NNV-2-wtFS, and product I in P815 cells. 

Figure 10 shows the expression of the Nef and E2 proteins from the plas- 
mids NNV-2-wt, NNV-2-wtFS, and product I in CHO cells. 

Figure 1 1 shows the expression of the Nef protein from the plasmids NNV- 
2-wt, NNV-2-wtFS, and product I in RD cells. 
20 Figure 1 2 shows the expression of the RNA molecules NNV-2wt in CHO u 

Jurkat cells, and P815 cells. 

Figure 1 3 shows the stability of NNV-2wt in bacterial cells. 
Figure 1 4 shows the Southern blot analysis of stability of the NNV-2wt as 
non-replicating episomal element in CHO and Jurkat cell lines. 
25 Figure 1 5 shows that the vectors NNV2wt, NNV2wtFS and productl are 

unable to HPV-1 1 replication factor-dependent replication. 

Figure 16 shows the schematic map of the plasmid 2wtd1EGFP. 
Figure 1 7 shows the schematic map of the plasmid gf 1 0bse2 
Figure 18 shows the schematic map of the plasmid 2wtd1EGFPFS. 
30 Figure 1 9 shows the schematic map of the plasmid NNVdl EGFP. 

Figure 20 shows the growth curves of the Jurkat cells transfected with the 
plasmids 2wtd1EGFP, 2wtd1EGFPFS, NNVdl EGFP or with carrier DNA only. 
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Figure 21 shows the growth curves of the Jurkat cells transfected with the 
. plasmids 2wtd1 EGFP, 2wtd1 EGFPFS, gf1 0bse2 or with carrier DNA only. 

Figure 22 shows the change in the percentage of d1 EGFP positive cells in 
a population of Jurkat cells transfected with the vectors 2wtd1EGFP, 
5 2wtd1 EGFPFS or NNVdl EGFP. 

Figure 23 shows the change in percentage of the d1 EGFP positive cells in 
a population of Jurkat cells transfected with the vectors 2wtd1EGFP, 
2wtd1 EGFPFS orgf10bse2. 

Figure 24 shows the change in the number of d1 EGFP expressing cells in 
10 a population of Jurkat cells transfected with the vectors 2wtd1EGFP, 
2wtd1 EGFPFS or NNVdl EGFP. 

Figure 25 shows the change in the number of d1 EGFP expressing cells in 
a population of Jurkat cells transfected with the vectors 2wtd1EGFP, 
2wtd1 EGFPFS or gf 1 0bse2. 
15 Figure 26. T-cell responses towards recombinant Nef proteins (5 micro- 

grams/well), measured by T-cell proliferation in five patients immunized with 1 mi- 
crogram of GTU-Nef. 

Figure 27. T-cell responses towards recombinant Nef proteins (5 micro- 
grams/well), measured by T-cell proliferation in five patients immunized with 20 
20 micrograms of GTU-Nef. 

Figure 28. T-cell responses towards recombinant Nef proteins (5 micro- 
grams/well), measured by T-cell proliferation in patient # 1 immunized with 1 mi- 
crogram of GTU-Nef. The results are given as stimulation index of the T-cell pro- 
liferation assay (Nef SI) and as IFN-Gamma secretion to the supernatant. 
25 Figure 29. (A) plasmid pEBO LPP; (B) plasmid s6E2d1 EGFP; (C) plasmid 

FRE2d1EGFP 

Figure 30. Plasmid FREBNAdlEGFP 
Figure 31. Vectors did not interfere with cell proliferation 
Figure 32. Vectors were maintained in the cells with different kinetics 
30 Figure 33. Change of the number of dIEGFP expressing cells in time in 

transfected total population of cells 
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Figure 34. Change of the number of d1 EGFP expressing cells in time in 
transfected total population of cells. (A) human embryonic cell line 293; (B) 
" mouse cell line 3T6 

Figure 35. Nef and E2 antibody response 
5 Figure 36. Rev and Tat antibody response 

Figure 37. Gag and CTL response 

Figure 38. (A) GTU-1; (B) GTU-2Nef; (C) GTU-3Nef; (D) su- 
per6wtd1 EGFP; (E) FREBNAdl EGFP; (F) E2BSEBNAd1 EGFP; (G) NNV-Rev 

Figure 39. (A) pNRT; (B) pTRN; (C) pRTN; (D) pTNR; (E) pRNT; (F) 
10 p2TRN; (G) p2RNT; (H) p3RNT; (I) P TRN-iE2-GMCSF; (J) pTRN-iMG-GMCSF 

Figure 40. (A) pMVINTR; (B) pMV2NTR; (C) pMV1N11TR; (D) 
pMV2N11TR 

Figure 41. (A) pCTL; (B) pdgag; (C) psynp17/24; (D) poptp17/24; (E) 
p2mCTL; (F) p2optp17/24; (G) p3mCTL; (H) p3optp17/24 

15 Figure 42. (A) pTRN-CTL; (B) pRNT-CTL; (C) pTRN-dgag; (D) pTRN- 

CTL-dgag; (E) pRNT-CTL-dgag; (F) pTRN-dgag-CTL; (G) pRNT-dgag-CTL; (H) 
P TRN-optp17/24-CTL; (I) P TRN-CTL-optp17/24; (J) P RNT-CTL-optp17/24; (K) 
P 2TRN-optp17/24-CTL; (L) P 2RNT-optp17/24-CTL; (M) P 2TRN-CTL-optp17/24; 
(N) P 2RNT-CTL-optp17/24; (O) P 2TRN-CTL-optp17/24-iE2-mGMCSF; (P) 

20 P 2RNT-CTL-optp1 7/24-iE2-mGMCSF; (Q) P 3TRN-CTL-optp1 7/24; (R) p3RNT- 
CTL-optp17/24; (S) P 3TRN-CTL-optp17/24-iE2-mGMCSF; (T) p3RNT-CTL- 
optpl 7/24-iE2-mGMCSF; (U) FREBNA-RNT-CTL-optp1 7/24; (V) super6wt-RNT- 
CTL-optp17/24; (W) E2BSEBNA- RNT-CTL-optp17/24; (X) pCMV- RNT-CTL- 
0ptp17/24 

25 Figure 43. Analysis of expression of the multireg antigens. 

Figure 44. Analysis of expression of the multireg antigens comprised of 
immunodominant parts of the proteins. 

Figure 45. Analysis of intracellular localization of multireg antigens by im- 
munofluorescence. 

30 Figure 46. Analysis of expression of the gag coded structural proteins and 

the CTL multi-epitope. 

47. The p17/24 protein localization in membranes of RD cells. 
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Figure 48. Analysis expression of the dgag and CTL containing multi- 
genes in Cos-7 cells. 

Figure 49. Western blot analyses of multiHIV antigens expressed in Jurkat 

cells. 

5 Figure 50. Analysis of the expression of the TRN-CTL-optp17/24 and 

RNT-CTL-optp17/24 antigens as well E2 protein from the GTU-1, GTU-2 and 
GTU-3 vector. 

Figure 51 . The maintenance of the multiHIV antigen expression from dif- 
ferent vectors. 

10 Figure 52. Intracellular localization of the multiHIV antigens in RD cells. 

5. DETAILED DESCRIPTION OF THE INVENTION 
5.1 VECTORS OF THE INVENTION 

The present invention is based on the unexpected finding that expression 

15 vectors, which carry (A) an expression cassette of a gene of a nuclear-anchoring 
protein that binds both to (i) a specific DNA sequence and (ii) to a suitable nu- 
clear component and (B) a multimerized DNA binding sequence for said nuclear- 
anchoring protein are capable of spreading in a proliferating cell population. 
Such nuclear-anchoring proteins include, but are not limited to, chromatin- 

20 anchoring proteins, such as the Bovine Papilloma Virus type 1 E2 protein (BPV1 
E2; SEQ ID NO: 50). The DNA binding sequences can be, but are not limited to, 
multimerized E2 binding sites. On the basis of prior art, it could not be expected 
that a segregation/partitioning function of, for instance, the papilloma viruses 
could be expressed separately and that an addition of such segrega- 

25 tion/partitioning function to the vaccine vectors would assure the distribution of 
the vector in the proliferating cell population. Additionally, on the* basis of the 
prior art, it could not have been expected that functional vectors acting independ- 
ently of the replication origin can be constructed. 

The term "nuclear-anchoring protein" as used in the present invention re- 

30 fers to a protein, which binds to a specific DNA sequence and capable of provid- 
ing a nuclear compartmentalization function to the vector, i.e., to a protein, which 
is capable of anchoring or attaching the vector to a specific nuclear compartment. 
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In certain embodiments of the invention, the nuclear-anchoring protein is a natu- 
ral protein. Examples of such nuclear compartments are the mitotic chromatin or 
mitotic chromosomes, the nuclear matrix, nuclear domains like ND10 and POD 
etc. Examples of nuclear-anchoring proteins are the Bovine Papilloma Virus type 

5 1 (BPV1)E2 protein, EBNA1 (Epstein-Barr Virus Nuclear Antigen 1; SEQ ID NO: 
52), and High Mobility Group (HMG) proteins etc. The term "functional equivalent 
of a nuclear-anchoring protein" as used in the present invention refers to a protein 
or a polypeptide of natural or non-natural origin having the properties of the nu- 
clear-anchoring protein. 

10 In certain other embodiments of the invention, the nuclear-anchoring pro- 

tein of the invention is a recombinant protein. In certain specific embodiments of 
the invention, the nuclear-anchoring protein is a fusion protein, a chimeric protein, 
or a protein obtained by molecular modeling. A fusion protein, or a protein ob- 
tained by molecular modeling in connection with the present invention is charac- 

15 terized by its ability to bind to a nuclear component and by its ability to bind se- 
quence-specifically to DNA.. In a preferred embodiment of the invention, such a 
fusion protein is encoded by a vector of the invention which also contains the 
specific DNA sequence to which the fusidn/chimeric protein binds. Nuclear com- 
ponents include, but are not limited to chromatin, the nuclear matrix, the ND10 

20 domain and POD. In order to reduce the risk of interference with the expression 
of genes endogenous to the host cell, the- DNA binding domain and the corre- 
sponding DNA sequence is preferably non-endogenous to the host cell/host or- 
ganism. Such domains include, but are not limited to, the DNA binding domain of 
the Bovine Papilloma Virus type 1 (BPV1) E2 protein (SEQ ID NO: 50), Epstein- 

25 Barr Virus Nuclear Antigen 1 (EBNA1; SEQ ID NO: 52), and High Mobility Group 
(HMG) proteins (HMG box). 

The vector of the invention can further comprise a "DNA sequence of in- 
terest", that encodes a protein (including a peptide or polypeptide), e.g., that is an 
immunogen or a therapeutic. In certain embodiments of the invention, the DNA 

30 sequence of interest encodes a biologically active RNA molecule, such as an an- 
tisense RNA molecule or a ribozyme. 
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The expression vectors of the invention carrying an expression cassette for 
a gene of a nuclear-anchoring protein and multimerized binding sites for said nu- 
clear-anchoring protein spread in a proliferating host cell population. This means 
that a high copy-number of vectors or plasmids are delivered into the target cells 

5 and the use of the segregation/partitioning function of the nuclear-anchoring pro- 
tein and its multimerized binding sites assures the distribution of the vector to the 
daughter cells during cell division. 

The vector of the invention lacks a papilloma virus origin of replication. 
Further, in a preferred embodiment, the vector of the invention lacks an origin of 

10 replication functional in a mammalian cell. The omission of a papilloma virus ori- 
gin of replication or a mammalian origin of replication constitutes an improvement 
over prior art vectors for several reasons. (1) Omission of the origin of replication 
reduces the size of the vector of the invention compared to prior art vectors. 
Such a reduction in size increases the stability of the vector and facilitates uptake 

15 by the host cell. (2) Omission of the origin of replication reduces the risk for re- 
combination with the host cell's genome, thereby reducing the risk of unwanted 
side effects. (3) The omission of the origin of replication allows to control the do- 
sage simply by adjusting the amount of vector administered. In contrast, with a 
functioning origin of replication, replication of the vector has to be taken into con- 

20 sideration when determining the required dosage. (4) If the vector is not adminis- 
tered to a host organism continually, the lack of an origin of replication allows the 
host organism to clear itself of the vector, thus providing more control over the 
levels of DNA sequences to be expressed in the host organism. Further, the abil- 
ity of the organism to clear itself of the vector will be advantageous if the pres- 

25 ence of the vector is required only during the course of a therapy but is undesir- 
able in a healthy individual. 

The gene of a nuclear-anchoring protein useful in the vectors of the pre- 
sent invention can be any suitable DNA sequence encoding a natural or artificial 
protein, such as a recombinant protein, a fusion protein or a protein obtained by 

30 molecular modeling techniques, having the required properties. Thus the gene of 
a natural nuclear-anchoring protein, which contains a DNA binding domain capa- 
ble of binding to a specific DNA sequence and a functional domain capable of 



WO 02/090558 



PCT/FI02/00379 



22 

binding to a nuclear component, can be that of a viral protein, such as the E2 pro- 
tein of Bovine Papilloma Virus or the EBNA1 (Epstein-Barr Virus Nuclear Antigen 
1 ) of the Epstein-Barr Virus, a eukaryotic protein such a one of the High Mobility 
Group (HMG) proteins or a like protein, or a prokaryotic protein. Alternatively, the 

5 gene of a nuclear-anchoring protein, which contains a DNA binding domain ca- 
pable of binding to a specific DNA sequence and a functional domain capable of 
binding to a nuclear component, can also be comprised of DNA sequences, 
which encode a domain from a cellular protein having the ability to attach to a sui- 
table nuclear structure, such as to mitotic chromosomes, the nuclear matrix or 

10 nuclear domains like ND10 or POD. 

Alternatively, the DNA sequence, which encodes a non-natural or artificial 
protein, such as a recombinant protein or a fusion protein or a protein obtained by 
molecular modeling, which contains a DNA binding domain capable of binding to 
a specific DNA sequence of, e.g., a papilloma virus, such as the DNA binding 

15 domain of the E2 protein of the BPV1 , but in which the N-terminus of the nuclear- 
anchoring protein, e.g. that of the E2 protein, has been replaced with domains of 
any suitable protein of similar capacity, for example, with the N-terminal domain 
of Epstein-Barr Virus Nuclear Antigen 1 sequence, can be used. Similarly, DNA 
sequences, which encode a recombinant protein or a fusion protein, which con- 

20 tains a functional domain capable of binding to a nuclear component, e.g., the N- 
terminal functional domain of a papilloma virus, such as the E2 protein of the 
BPV1 , but in which the C-terminal DNA-binding dimerization domain of the nu- 
clear-anchoring protein, e.g., that of the E2 protein, has been replaced with do- 
mains of any protein of a sufficient DNA-binding strength, e.g., the DNA binding 

25 domain of the BPV-1 E2 protein and the EBNA-1 , can be used. 

In a preferred embodiment of the invention, the nuclear-anchoring protein 
is a chromatin-anchoring protein, which contains a DNA binding domain, which 
binds to a specific DNA sequence, and a functional domain capable of binding to 
mitotic chromatin. A preferred example of such a chromatin-anchoring protein 

30 and its multimerized binding sites useful in the present invention are the E2 pro- 
tein of Bovine Papilloma Virus type 1 and E2 protein multimerized binding sites. 
In the case of E2, the mechanism of the spreading function is due to the dual 
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function of the E2 protein: the capacity of the E2 protein to attach to mitotic chro- 
mosomes through the N-terminal domain of the protein and the sequence-specific 
binding capacity of the C-terminai domain of the E2 protein, which assures the 
tethering of vectors, which contain a multimerized E2 binding site, to mitotic 
5 chromosomes. A segregation/partitioning function is thus provided to the vectors. 

In another preferred embodiment of the invention, the expression cassette 
of a gene of the chromatin-anchoring protein comprises a gene of any suitable 
protein of cellular, viral or recombinant origin having analogous properties to E2 
of the BPV1 , i.e., the ability to attach to the mitotic chromatin through one domain 

10 and to cooperatively bind DNA through another domain to multimerized binding 
sites specific for this DNA binding domain. 

In a specific embodiment, sequences obtained from BPV1 , are used in the ' 
vectors of the present invention, they are extensively shortened in size to include 
just two elements from BPV1 . First, they include the E2 protein coding sequence 

15 transcribed from a heterologous eukaryotic promoter and polyadenylated at the 
heterologous polyadenylation site. Second, they include E2 protein multiple bind- 
ing sites incorporated into the vector as a cluster, where the sites can be as head- 
to-tail structures or can be included into the vector by spaced positioning. Both of 
these elements are necessary and, surprisingly, sufficient for the function of the 

20 vectors to spread in proliferating cells. Similarly, when DNA sequences based of 
other suitable sources are used in the vectors of the present invention, the same 
principles are applied. 

According to the present invention, the expression cassette of a gene of a 
nuclear-anchoring protein, which contains a DNA binding domain capable of bind- 

25 ing to a specific DNA sequence and a functional domain capable of binding to a 
nuclear component, such as an expression cassette of a gene of a chromatin- 
anchoring protein, like BPV1 E2, comprises a heterologous eukaryotic promoter, 
the nuclear-anchoring protein coding sequence, such as a chromatin-anchoring 
protein coding sequence, for instance the BPV1 E2 protein coding sequence, and 

30 a poly A site. Different heterologous, eukaryotic promoters, which control the ex- 
pression of the nuclear-anchoring protein, can be used. Nucleotide sequences of 
such heterologous, eukaryotic promoters are well known in the art and are readily 
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available. Such heterologous eukaryotic promoters are of different strength and 
tissue-specificity. In a preferred embodiment, the nuclear anchoring protein is 
expressed at low levels. 

The multimerized DNA binding sequences, i.e., DNA sequences containing 

5 multimeric binding sites, as def ined in the context of the present invention, are the 
region, to which the DNA binding dimerization domain binds. The multimerized 
DNA binding sequences of the vectors of the present invention can contain any 
suitable DNA binding site, provided that it fulfills the above requirements. 

In a preferred embodiment, the multimerized DNA binding sequence of a 

10 vector of the present invention can contain anyone of known 17 different affinity 
E2 binding sites as a hexamer or a higher oligomer, as a octamer or a higher oli- 
gomer, as a decamer or higher oligomer. Oligomers containing different 112 bind- 
ing sites are also applicable. Specifically preferred E2 binding sites useful in the 
vectors of the present invention are the BPV1 high affinity sites 9 and 10, affinity 

15 site 9 being most preferred. When a higher oligomer is concerned, its size is lim- 
ited only by the construction circumstances and it may contain from 6 to 30 iden- 
tical binding sites. Preferred vectors of the invention contain 10 BPV-1 E2 binding 
sites 9 in tandem. When the multimerized DNA binding sequences are comprised 
of different E2 binding sites, their size and composition is limited only by the met- 

20 hod of construction practice. Thus they may contain two or more different E2 bin- 
ding sites attached to a series of 6 to 30, most preferably 10, E2 binding sites. 
The Bovine Papilloma Virus type 1 genome (SEQ ID NO: 49) contains 1 7 E2 pro- 
tein binding sites which differ in their affinity to E2. The E2 binding sites are 
described in Li et al. [Genes Dev 3(4) (1989) 510-526], which is incorporated by 

25 reference in its entirety herein. 

Alternatively, the multimerized DNA binding sequences may be composed 
of any suitable multimeric specific sequences capable of inducing the cooperative 
binding of the protein to the plasmid, such as those of the EBNA1 or a suitable 
HMG protein. 21x30bp repeats of binding sites for EBNA-1 are localized in the 

30 region spanning from nucleotide position 7421 to nucleotide position 8042 of the 
Epstein-Barr virus genome (SEQ ID NO:51 ). These EBNA-1 binding sites are de- 
scribed in the following references: Rawlins et al., Cell 42(3) (1985) 859-868; 
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Reisman et a!., Mo! Cell Biol 5(8) (1985) 1822-1832; and Lupton and Levine, Mol 
Cell Biol 5(1 0) (1 985) 2533-2542, all three of which are incorporated by reference 
in their entireties herein. 

The position of the multimerized DNA binding sequences relative to the 
5 expression cassette for the DNA binding dimerization domain is not critical and 
can be any position in the plasmid. Thus the multimerized DNA binding se- 
quences can be positioned either downstream or upstream relative to the expres- 
sion cassette for the gene of interest, a position close to the promoter of the gene 
of interest being preferred. 

10 The vectors of the invention also contain, where appropriate, a suitable 

promoter for the transcription of the gene or genes or the DNA sequences of in- 
terest, additional regulatory sequences, polyadenylation sequences and introns. 
Preferably the vectors may also include a bacterial plasmid origin of replication 
and one or more genes for selectable markers to facilitate the preparation of the 

15 vector in a bacterial host and a suitable promoter for the expression the gene for 
antibiotic selection. 

The selectable marker can be any suitable marker allowable in DNA vac- 
cines, such a kanamycin or neomycin, and others. In addition, other positive and 
negative selection markers can be included in the vectors of the invention, where 

20 applicable. 

The vectors of the present invention only comprise the DNA sequences, 
for instance BPV1 DNA sequences, which are necessary and sufficient for long- 
term maintenance. All superfluous sequences, which may induce adverse reac- 
tions, such as oncogenic sequences, have been deleted. Thus in preferred vec- 

25 tors of the invention the E2 coding sequence is modified by mutational analysis 
so that this expresses only E2 protein and overlapping E3, E4 and E5 sequences 
have been inactivated by the introduction of mutations, which inactivate the trans- 
lation from Open Reading Frames for E3, E4 and E5. The vector of the invention 
does not contain a papilloma virus origin of replication. Preferably, the vector of 

30 the invention further does not contain an origin of replication functional in a 
. mammalian cell or a mammal. 
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Furthermore, the vectors of the present invention are not host specific, 
since the expression of the nuclear-anchoring protein, such as the E2 protein, is 
controlled by non-native or heterologous promoters. Depending on the particular 
promoter chosen, these promoters may be functional in a broad range of mam- 

5 malian cells or they can be cell or tissue specific. Examples of promoters for the 
nuclear-anchoring protein, such as for the E2 protein, useful in the vectors of the 
present invention are thymidine kinase promoters, Human Cytomegalovirus Im- 
mediate Early Promoter, Rous Sarcoma Virus LTR and like. For the expression 
of the gene of interest, preferred promoters are strong promoters assuring high 

10 levels of expression of the gene of interest, an example for such a promoter is the 
. Human Cytomegalovirus Immediate Early Promoter. 

5.2 THE VECTORS OF THE INVENTION AS VEHICLES FOR EXPRES- 
SION OF A DNA SEQUENCE OF INTEREST 
15 A gene, genes or a DNA sequence or DNA sequences to be expressed via 

a vector of the invention can be any DNA sequence of interest, whose expression 
is desired. Thus the vectors may contain a gene or genes or a DNA sequence or 
DNA sequences from infectious microbial pathogens, such as viruses, against 
which live attenuated vaccines or inactivated vaccines cannot be prepared or 
20 used. Such DNA sequences of interest include genes or DNA sequences from 
viruses, such as Human Immunodeficiency Virus (HIV), Herpex Simplex Virus 
(HSV), Hepatitis C Virus, Influenzae Virus, Enteroviruses etc.; intracellular bacte- 
rial, such as Chlamydia trachomatis, Mycobacterium tuberculosis, Mycoplasma 
pneumonia etc.; extracellular bacteria, such as Salmonella; or fungi, such as 
25 Candida albigans. 

In a preferred embodiment of the invention, the vectors contain a gene en- 
coding early regulatory proteins of HIV, i.e. the nonstructural regulatory proteins 
Nef, Tat or Rev, preferably Nef. In another preferred embodiment of the invention 
the vectors of the invention contain genes encoding structural proteins of the HIV. 
30 In another preferred embodiment the vectors of the present invention contain two 
or more genes encoding any combination of early regulatory proteins and/or 
structural proteins of HIV. Illustrative examples of such combinations are a com- 
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bination of a gene encoding the Net protein and a DNA sequence encoding the 
Tat protein, possibly together with a DNA sequence encoding outer envelope gly- 
coprotein of HIV, gp1 20/gp1 60 or a combination of any immunogenic epitopes of 
the proteins of pathogens incorporated into artificial recombinant protein. 
5 Alternatively, the vectors of the invention may contain genes or DNA se- 

quences for inherited or acquired genetic defects, such as sequences of differen- 
tiation antigens for melanoma, like a Tyrosinase A coding sequence or a coding 
sequence of beta-catenins. 

In a preferred embodiment of the invention, the vectors contain a gene en- 
10 coding proteins relating to cancer or other mutational diseases, preferably dis- 
eases related to immune maturation and regulation of immune response towards 
self and nonself, such as the APECED gene. 

In another preferred embodiment of the invention, the vectors contain any 
DNA sequence coding for a protein that is defective in any hereditary single gene 
15 hereditary disease. 

In another preferred embodiment of the invention, the vectors contain-any 
DNA sequence coding for a macromolecular drug to be delivered and produced 
in vivo. 

The method of the invention for the preparation of the vectors of the inven- 
20 tion comprises the following steps: (A) cultivating a host cell containing a vector 
e of the invention, and (B) recovering the vector. In certain specific embodiments, 
step (A) is preceded by transforming a host cell with a vector of the invention. 

The vectors of the invention are preferably amplified in a suitable bacterial 
host cell, such as Escherichia coli. The vectors of the invention are stable and 
25 replicate at high copy numbers in bacterial cells. If a vector of the invention is to 
be amplified in a bacterial hast ceil, the vector of the invention contains a bacte- 
rial origin of replication. Nucleotide sequences of bacterial origins of replication 
are well known to the skilled artisan and can readily be obtained. 

Upon transfection into a mammalian host in high copy number, the vector 
30 spreads along with cell divisions and the number of cells carrying the vector in- 
creases without the replication of the vector, each cell being capable of express- 
ing the protein of interest. 
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The vectors of the invention result in high expression of the desired pro- 
tein. For instance, as demonstrated in Examples 4, 7 - 10: a high expression of 
the Nef protein of the HIV, green fluorescent protein (EGFP) and the AIRE protein 
could be demonstrated in many different cell lines and the data indicate that not 
5 only the number of positive cells, but the quantity of the protein encoded by the 
gene of interest is increasing in time. 

The vectors of the invention also induce both humoral and cellular re- 
sponse as demonstrated in Examples 9 and 10. The results indicate that the vec- 
tors of the present invention can effectively be used as DNA vaccines. 
10 The'vaccines of the present invention contain a vector of the present in- 

vention or a mixture of said vectors in a suitable pharmaceutical carrier. The vac- 
cine may for instance contain a mixture of vectors containing genes for the three 
different regulatory proteins of the HIV and/or structural proteins of the HIV. 

The vaccines of the invention are formulated using standard methods of 
15 vaccine formulation to produce vaccines to be administered by any conventional 
route of administration, i.e. intramuscularily, intradermal^ and like. 

The vectors of the invention may contain the ISS stimulatory sequences in 
order to activate the immune response of the body. 

The vaccines of the invention can be used in a conventional preventive 
20 manner to protect an individual from infections, Alternatively, the vaccines of the 
invention can be used as therapeutical vaccines, especially in the case of viral in- 
fections, together with a conventional medication. 

As mentioned above, the vectors of the present invention carrying the 
mechanism of spreading in the host cell find numerous applications as vaccines, 
25 in gene therapy, in gene transfer and as therapeutic immunogens. The vectors of 
the invention can be used to deliver a normal gene to a host having a gene de- 
fect, thus leading to a cure or therapy of a genetic disease. Furthermore, the vec- 
tors can deliver genes of immunogenic proteins of foreign origin, such as those 
from microbes or autologous tumor antigens, to be used in the development of 
30 vaccines against microbes or cancer. Furthermore, the vectors of the invention 
can deliver suitable genes of marker substances to nucleus, to be used in studies 
of cellular function or in diagnostics. Finally, the vectors of the invention can be 
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used to specifically deliver a gene of macromolecular drug to the nucleus, thus 
enabling the development of novel therapeutic principles to treat and cure dis- 
eases, where the expression of the drug in the site of action, the cell nucleus, is 
of importance. These drugs can be chemical macromolecules, such as any pro- 
5 teins or polypeptides with therapeutic or curative effect, which interfere with any 
of the nuclear mechanisms, such as the replication or transcription or the trans- 
port of substances to and from the nucleus. 

Specifically, the vectors of the present invention can be used for the ex- 
pression of the specific cytokines, like interleukines (IL1, IL2, IL4, IL6, IL12 and 

1 o others) or interferon, with the aim of modulating the specific immune responses of 
the organism (immunotherapy) against foreign antigens or boosting of the activity 
of the immune system against the mutated self-antigens. The vectors of the pre- 
sent invention are also useful in complementing malfunctioning of the brain due 
to the loss of specific dopamine-ergic neurons leading to the irreversible neu- 

1 5 rodegeneration, which is cause for Parkinson's disease, by expressing genes in- 
volved into synthesis of dopamine, like tyrosine hydroxylase, as well as other 
genes deficiency of which would have the similar effect. The vectors of the pre- 
sent invention are also useful for the expression of proteins and peptides regulat- 
ing the brain activity, like dopamine receptors, CCK-A and CCK-B receptors, as 

20 well as neurotrophic factors, like GDNF, BDNF and other proteins regulating the 
brain activity. Further, the vectors of the present invention are useful for a long- 
term expression of factor IX in hepatocytes and alfal -antitrypsin in muscle cells 
with the aim of complementing respective deficiencies of the organism. 

25 5.3 TARGET DISEASES AND DISORDERS 

In certain embodiments, a vector of the invention is used as a vaccine. In 
certain embodiments, a vector of the invention contains a DNA sequence of in- 
terest that encodes a protein or a peptide. Upon administering of such a vector to 
a subject, the protein or peptide encoded by the DNA sequence of interest is ex- 

30 pressed and stimulates an immune response specific to the protein or peptide 
encoded by the DNA sequence of interest. 
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In specific embodiments, the vector of the invention is used to treat and/or 
prevent an infectious disease and/or a condition caused by an infectious agent. 
Such diseases and conditions include, but are not limited to, infectious diseases 
caused by bacteria, viruses, fungi, protozoa, helminths, and the like. In a more 
5 specific embodiment of the invention, the infectious disease is Acquired Immuno- 
deficiency Syndrome. 

Preferably, where it is desired to treat or prevent viral diseases, DNA se- 
quences encoding molecules comprising epitopes of known viruses are used. 
For example, such DNA sequences encoding antigenic epitopes may be pre- 
10 pared from viruses including, but not limited to, hepatitis type A, hepatitis type B, 
hepatitis type C, influenza, varicella, adenovirus, herpes simplex type I (HSV-I), 
herpes simplex type II (HSV-II), rinderpest, rhinovirus, echovirus/rotavirus, respi- 
ratory syncytial virus, papilloma virus, papova virus, cytomegalovirus, echinovirus, 
arbovirus, huntavirus, coxsackie virus, mumps virus, measles virus, rubella virus, 
15 polio virus, human immunodeficiency virus type I (HIV-I), and human immunode- 
ficiency virus type II (HIV-II). 

Preferably, where it is desired to treat or prevent bacterial infections, DNA 
sequences encoding molecules comprising epitopes of known bacteria are used. 
For example, such DNA sequences encoding antigenic epitopes may be pre- 
20 pared from bacteria including, but not limited to, mycobacteria rickettsia, 
mycoplasma, neisseria and legionella. 

Preferably, where it is desired to treat or prevent protozoal infections, DNA 
sequences encoding molecules comprising epitopes of known protozoa are used. 
For example, such DNA sequences encoding antigenic epitopes may be pre- 
25 pared from protozoa including, but not limited to, leishmania, kokzidioa, and try- 
panosoma. 

Preferably, where it is desired to treat or prevent parasitic infections, DNA 
sequences encoding molecules comprising epitopes of known parasites are 
used. For example, such DNA sequences encoding antigenic epitopes may be 
30 prepared from parasites including, but not limited to, chlamydia and rickettsia. 

In other specific embodiments, the vector of the invention is used to treat 
and/or prevent a neoplastic disease in a subject. In these embodiments, the DNA 
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sequence of interest encodes a protein or peptide that is specific to or associated 
with the neoplastic disease. By way of non-limiting example, the neoplastic dis- 
ease can be a fibrosarcoma, myxosarcoma, liposarcoma, chondrosarcoma, os- 
teogenic sarcoma, chordoma, angiosarcoma, endotheliosarcoma, lymphangio- 
5 sarcoma, lymphangioendotheliosarcoma, synovioma, mesothelioma, Ewing's tu- 
mor, leiomyosarcoma, rhabdomyosarcoma, colon carcinoma, pancreatic cancer, 
breast cancer, ovarian cancer, prostate cancer, squamous cell carcinoma, basal 
eel! carcinoma, adenocarcinoma, sweat gland carcinoma, sebaceous gland carci- 
noma, papillary carcinoma, papillary adenocarcinomas, cystadenocarcinoma, 

10 medullary carcinoma, bronchogenic carcinoma, renal cell carcinoma, hepatoma, 
bile duct carcinoma, choriocarcinoma, seminoma, embryonal carcinoma, Wilms 1 
tumor, cervical cancer, testicular tumor, lung carcinoma, small cell lung carci- 
noma, bladder carcinoma, epithelial carcinoma, glioma, astrocytoma, medul- 
loblastoma, craniopharyngioma, ependymoma, pinealoma, hemangioblastoma, 

15 acoustic neuroma, oligodendroglioma, meningioma, melanoma, neuroblastoma, 
retinoblastoma; leukemias, e.g., acute lymphocytic leukemia and acute myelo- 
cytic leukemia (myeloblasts, promyelocyte, myelomonocytic, monocytic and 
erythroleukemia); chronic leukemia (chronic myelocytic (granulocytic) leukemia 
and chronic lymphocytic leukemia); and polycythemia vera, lymphoma (Hodgkin's 

20 disease and non-Hodgkin's disease), multiple myeloma, Waldenstrom's 
macroglobulinemia, and heavy chain disease, etc. 

In certain other embodiments of the invention, the DNA sequence of inter- 
est encodes a protein that is non-functional or malfunctioning due to an inherited 
disorder or an acquired mutation in the gene encoding the protein. Such genetic 

25 diseases include, but are not limited to, metabolic diseases, e.g., Atherosclerosis 
(affected gene: APOE); cancer, e.g., Familial Adenomatous Polyposis Coli (af- 
fected gene: APC gene); auto-immune diseases, e.g., autoimmune polyendocri- 
nopathy-candidosis-ectodermal dysplasia (affected gene: APECED); disorders of 
the muscle, e.g., Duchenne muscular dystrophyvaccines (affected gene: DMD); 

30 diseases of the nervous system, e.g., Alzheimer's Disease (affected genes: PS1 
andPS2). 
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In even'other embodiments, the vectors of the invention are used to treat 
and/or prevent diseases and disorders caused by pathologically high activity of a 
protein. In these embodiments of the invention, the DNA sequence of interest 
encodes an antagonist of the overactive protein. Such antagonists include, but 

5 are not limited to, antisense RNA molecules, ribozymes, antibodies, and domi- 
nant negative proteins. In specific embodiments of the invention, the DNA se- 
quence of interest encodes an inhibitor of an oncogene. 

In certain embodiments, the DNA sequence of interest encodes a mole- 
cule that antagonizes neoplastic growth. In specific embodiments of the inven- 

10 tion, the DNA sequence of interest encodes a tumor suppressor, such as, but not 
limited to, p53. In other specific embodiments, the DNA sequence of interest 
encodes an activator of apoptosis, such as but not limited to, a Caspase. 

The invention provides methods, whereby a DNA sequence of interest is 
expressed in a subject. In certain embodiments, a vector containing one or more 

15 expression cassettes of a DNA sequence of interest is administered to the sub- 
ject, wherein the subject does not express the Bovine Papillomavirus E1 protein. 

5.4 THERAPEUTIC METHODS FOR USE WITH THE INVENTION 
20 5.4.1 RECOMBINANT DNA 

In various embodiments of the invention, the vector of the invention com- 
prises one or more expression cassettes comprising a DNA sequence of interest. . 
The DNA sequence of interest can encode a protein and/or a biologically active 
RNA molecule. In either case, the DNA sequence is inserted into the vector of 
25 the invention for expression in reconibinant cells or in cells of the host in the case 
of gene therapy. 

An expression cassette, as used herein, refers to a DNA sequence of in- 
terest operably linked to one or more regulatory regions or enhancer/promoter 
sequences which enables expression of the protein of the invention in an appro- 
30 priate host cell. "Operably-linked" refers to an association in which the regulatory 
regions and the DNA sequence to be expressed are joined and positioned in 
such a way as to permit transcription, and in the case of a protein, translation. 
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The regulatory regions necessary for transcription of the DNA sequence of 
interest can be provided by the vector of the invention. In a compatible host- 
construct system, cellular transcriptional factors, such as RNA polymerase, will 
bind to the regulatory regions of the vector to effect transcription of the DNA se- 
5 quence of interest in the host organism. The precise nature of the regulatory re- 
gions needed for gene expression may vary from host cell to host cell. Generally, 
a promoter is required which is capable of binding RNA polymerase and promot- 
ing the transcription of an operably-associated DNA sequence. Such regulatory 
regions may include those 5-non-coding sequences involved with initiation of 

10 transcription and translation, such as the TATA box, capping sequence, CAAT 
sequence, and the like. The non-coding region 3' to the coding sequence may 
contain transcriptional termination regulatory sequences, such as terminators and 
polyadenylation sites. 

Both constitutive and inducible regulatory regions may be used for expres- 

15 sion of the DNA sequence of interest It may be desirable to use inducible pro- 
moters when the conditions optimal for growth of the host cells and the conditions 
for high level expression of the DNA sequence of interest are different. Examples 
of useful regulatory regions are provided below (section 5.4.4). 

In order to attach DNA sequences with regulatory functions, such as pro- 

20 moters, to the DNA sequence of interest or to insert the DNA sequence of interest 
into the cloning site of a vector, linkers or adapters providing the appropriate 
compatible restriction sites may be ligated to the ends of the cDNAs by tech- 
niques well known in the art [Wu et al., Methods in Enzymol 152 (1987) 343- 
349). Cleavage with a restriction enzyme can be followed by modification to cre- 

25 ate blunt ends by digesting back or filling in single-stranded DNA termini before 
ligation. Alternatively, a desired restriction enzyme site can be introduced into a 
fragment of DNA by amplification of the DNA by use of PCR with primers contain- 
ing the desired restriction enzyme site. 

The vector comprising a DNA sequence of interest operably linked to a 

30 regulatory region (enhancer/promoter sequences) can be directly introduced into 
appropriate host cells for expression of the DNA sequence of interest without fur- 
ther cloning. 
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For expression of the DNA sequence of interest in mammalian host cells, a 
variety of regulatory regions can be used, for example, the SV40 early and late 
promoters, the cytomegalovirus (CMV) immediate early promoter, and the Rous 
sarcoma virus long terminal repeat (RSV-LTR) promoter. Inducible promoters 

5 that may be useful in mammalian cells include but are not limited to those associ- 
ated with the metallothionein II gene, mouse mammary tumor virus glucocorticoid 
responsive long terminal repeats (MMTV-LTR), (3-interferon gene, and hsp70 
gene [Williams et al., Cancer Res. 49 (1989) 2735-42; Taylor et al., Mol. Cell 
Biol., 10 (1 990) 1 65-75]. It may be advantageous to use heat shock promoters or 

10 stress promoters to drive expression of the DNA sequence of interest in recombi- 
nant host cells. 

In addition, the expression vector may contain a selectable or screenable 
marker gene for initially isolating, identifying or tracking host cells that contain the 
vector. A number of selection systems may be used for mammalian cells, includ- 

15 ing but not limited to the Herpes simplex virus thymidine kinase [Wigler et al., Cell 
1 1 (1977) 223], hypoxanthine-guanine phosphoribosyltransferase [Szybalski and 
Szybalski, Proc. Natl. Acad. Sci. USA 48 (1962) 2026], and adenine phosphori- 
bosyltransferase [Lowy et al., Cell 22 (1980) 817] genes can be employed in tk", 
hgprf or aprf cells, respectively. Also, antimetabolite resistance can be used as 

20 the basis of selection for dihydrofolate reductase (dhfr), which confers resistance 
to methotrexate [Wigler et al., Natl. Acad. Sci. USA 77 (1 980) 3567; O'Hare et al., 
Proc. Natl. Acad. Sci. USA 78 (1 981) 1 527]; gpt, which confers resistance to my- 
cophenoiic acid [Mulligan & Berg, Proc. Natl. Acad. Sci. USA 78 (1981) 2072]; 
neomycin phosphotransferase (neo), which confers resistance to the aminoglyco- 

25 side G-418 [Colberre-Garapin et al., J. Mol. Biol. 150 (1981) 1]; and hygromycin 
phosphotransferase (hyg), which confers resistance to hygromycin [Santerre et 
al., 1984, Gene 30 (1984)147]. Other selectable markers, such as but not limited 
to histidinol and Zeocin® can also be used. 

30 5.4.2 EXPRESSION SYSTEMS AND HOST CELLS 

For use with the methods of the invention, the host cell and/or the host or- 
ganism preferably does not express the Bovine Papilloma Virus E1 protein. Fur- 
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thermore, preferably the vector of the invention does not encode the Bovine 
Papilloma Virus E1 protein. 

Preferred mammalian host cells include but are not limited to those derived 
from humans, monkeys and rodents, (see, for example, Kriegler M. in. ; "Gene 
5 Transfer and Expression: A Laboratory Manual", New York, Freeman & Co. 
1 990), such as monkey kidney cell line transformed by SV40 (COS-7, ATCC CRL 
1 651 ); human embryonic kidney line (293, 293-EBNA, or 293 cells subcloped for 
growth in suspension culture, Graham et al M J. Gen. Virol., 36 (1977) 59; baby 
hamster kidney cells (BHK, ATCC CCL 10); Chinese hamster ovary-cells-DHFR 

10 [CHO, Urlaub and Chasin. Proc. Natl. Acad. Sci. 77 (1980) 4216]; mouse Sertoli 
cells [Mather, Biol. Reprod. 23 (1980)243-251]; mouse fibroblast cells (NIH-3T3), 
monkey kidney cells (CVI ATCC CCL 70); african green monkey kidney cells 
(VERO-76, ATCC CRL-1587); human cervical carcinoma cells (HELA, ATCC 
CCL 2); canine kidney cells (MDCK, ATCC CCL 34); buffalo rat liver cells (BRL 

15 3A, ATCC CRL 1442); human lung cells (W138, ATCC CCL 75); human liver 
cells (Hep G2, HB 8065); and mouse mammary tumor cells (MMT 060562, ATCC 
CCL51). 

The vectors of the invention may be synthesized and assembled from 
known DNA sequences by well-known techniques in the art. The regulatory re- 

20 gionsand enhancer elements can be of a variety of origins, both natural and syn- 
thetic. Some host cells may be obtained commercially. 

The vectors of the invention containing a DNA sequence of interest can be 
introduced into the host cell by a variety of techniques known in the art, including 
but not limited to, for prokaryotic cells, bacterial transformation (Hanahan, 1985, 

25 in DNA Cloning, A Practical Approach, 1:109-136), and for eukaryotic cells, cal- 
cium phosphate mediated transfection [Wigler et al., Cell 1 1 (1977) 223-232], li- 
posome-mediated transfection [Schaefer-Ridder et al„ Science 215 (1982) 166- 
168], electroporation [Wolff etal., Proc Natl Acad Sci 84 (1987)3344], and micro- 
injection [Cappechi, Cell 22 (1980)479-4889]. 

30 In a specific embodiment, cell lines that express the DNA sequence of the 

invention may be engineered by using a vector that contains a selectable marker. 
By way of example but not limitation, following the introduction of the vector, en- 
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gineered cells may be allowed to grow for 1-2 days in an enriched media, and 
then are switched to a selective media. The selectable marker in the vector con- 
fers resistance to the selection and optimally allows only cells that contain the 
vector with the selectable marker to grow in culture. 

5 

5.4.3 VACCINE APPROACHES 

In certain embodiments, a vector of the invention comprising an expression 
cassette of a DNA sequence of interest is administered to a subject to induce an 
immune response. Specifically, the DNA sequence of interest encodes a protein 
10 (for example, a peptide or polypeptide), which induces a specific immune re- 
sponse upon its expression. Examples of such proteins are discussed in section 
5.3. 

For the delivery of a vector of the invention for use as a vaccine, methods 
may be selected from among those known in the art and/or described in section 
15 5.4.6. . 

5.4.4 GENE THERAPY APPROACHES 

In a specific embodiment, a vector of the invention comprising an expres- 
sion cassette comprising DNA sequences of interest is administered to treat, or 

20 prevent various diseases. The DNA sequence of interest may encode a protein 
and/or a biologically active RNA molecule. Gene therapy refers to therapy per- 
formed by the administration to a subject of an expressed or expressible DNA 
sequence. In this embodiment of the invention, the DNA sequences produce 
their encoded protein or RNA molecule that mediates a therapeutic effect. 

25 Any of the methods for gene therapy available in the art can be used ac- 

cording to the present invention. Exemplary methods are described below. 

For general reviews of the method of gene therapy, see, Goldspiel et a/., 
Clinical Pharmacy 12 (1993)488-505; Wu and Wu, Biotherapy 3 (1991) 87-95; 
Tolstoshev, Ann. Rev. Pharmacol. Toxicol. 32 (1 993) 573-596; Mulligan, Science 

30 260 (1993)926-932; Morgan and Anderson, Ann. Rev. Biochem. 62 (1993) 191- 
21 7; May, TIBTECH 1 , l(5) (1993)155-21 5. Methods commonly known in the art 
of recombinant DNA technology which can be used are described in Ausubel et 
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al. (eds.), Current Protocols in Molecular Biology, John Wiley & Sons, NY (1993); 
and Kriegler, Gene Transfer and Expression, A Laboratory Manual, Stockton 
Press, NY (1990). 

The following animal regulatory regions, which exhibit tissue specificity and 
5 have been utilized in transgenic animals, can be used for expression of the DNA 
sequence of interest in a particular tissue type: elastase I gene control region 
which is active in pancreatic acinar cells [Swift et al., Cell 38 (1984) 639-646; Or- 
nitzet al., Cold Spring Harbor Symp. Quant. Biol. 50 (1986) 399-409; MacDonald, 
Hepatology 7 (1 987) 425-51 5]; insulin gene control region which is active in pan- 

10 creatic beta cells [Hanahan, Nature 315 (1985)115-122], immunoglobulin gene 
control region which is active in lymphoid cells [Grosschedl et al., Cell 38 (1984) 
647-658; Adames et al., Nature 318 (1985) 533-538; Alexander et al., Mol. Cell. 
Biol. 7 (1987) 1436-1444], mouse mammary tumor virus control region which is 
active in testicular, breast, lymphoid and mast cells [Leder et al., Cell 45 (1986) 

15 485-495], albumin gene control region which is active in the liver [Pinkert et al., 
Genes and Devel. 1 (1987) 268-276], alpha-fetoprotein gene control region which 
is active in the liver [Krumlauf et al., Mol. Cell. Biol. 5 (1985)1639-1648; Hammer 
et al., Science 235 (1 987) 53-58; alpha 1 -antitrypsin gene control region which is 
active in the liver [Kelsey etal., Genes and Devel. 1 (1987) 161-171], beta-globin 

20 gene control region which is active in myeloid cells [Mogram et al., Nature 315 
(1985) 338-340; Kollias et al., Cell 46 (1986) 89-94]; myelin basic protein gene 
control region which is active in oligodendrocyte cells in the brain [Readhead et 
al., Cell 48 (1987) 703-712]; myosin light chain-2 gene control region which is ac- 
tive in skeletal muscle [Sani, Nature 314 (1985) 283-286], and gonadotropic re- 

25 leasing hormone gene control region which is active in the hypothalamus [Mason 
et al., Science 234 (1986)1372-1378]. 

Methods of delivery fro gene therapy approaches are well known in the art 
and/or described in section 5.4.6. 

30 5.4.5 INHIBITORY ANTISENSE AND RIBOZYME 

In certain embodiments of the invention a vector of the. invention contains 
a DNA sequence of interest that encodes an antisense or ribozyme RNA mole- 
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cule. Techniques for the production and use of such molecules are well known to 
those of skill in the art. 

Antisense RNA molecules act to directly block the translation of mRNA by 
hybridizing to targeted mRNA and preventing protein translation. Antisepse ap- 

5 proaches involve the design of oligonucleotides that are complementary to a tar- 
get gene mRNA. The antisense oligonucleotides will bind to the complementary 
target gene mRNA transcripts and prevent translation. Absolute complementar- 
ity, although preferred, is not required. 

A sequence "complementary" to a portion of an RNA, as referred to herein, 

10 means a sequence having sufficient complementarity to be able to hybridize with 
at least the non-polyA portion of an RNA, forming a stable duplex; in the case of 
double-stranded antisense nucleic acids, a single strand of the duplexDNA may 
thus be tested, or triplex formation may be assayed. The ability to hybridize will 
depend on both the degree of complementarity and the length of the antisense 

15 nucleic acid. Generally, the longer the hybridizing nucleic acid ! , the more base 
mismatches with an RNA it may contain and still form a stable duplex (or triplex, 
as the case may be). One skilled in the art can ascertain a tolerable degree of 
mismatch by use of standard procedures to determine the melting point of the 
hybridized complex. 

20 Antisense nucleic acids shouldbe at least six nucleotides in length, and 

are preferably oligonucleotides ranging from 6 to about 50 nucleotides in length. 
In specific aspects the oligonucleotide is at least 10 nucleotides, at least 17 nu- 
cleotides, at least 25 nucleotides or at least 50 nucleotides. In other embodi- 
ments of the invention, the antisense nucleic acids are at least 100, at least 250, 

25 at least 500, and at least 1000 nucleotides in length. 

Regardless of the choice of target sequence, it is preferred that in vitro 
studies are first performed to quantitate the ability of the antisense oligonucleo- 
tide to inhibit gene expression. It is preferred that these studies utilize controls 
that distinguish between antisense gene inhibition and nonspecific biological ef- 

30 fects of oligonucleotides. It is also preferred that these studies compare levels of 
the target RNA or protein with that of an internal control RNA or protein. Addi- 
tionally, it is envisioned that results obtained using the antisense DNA sequence 
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are compared with those obtained using a control DNA sequence. It is preferred 
that the control DNA sequence is of approximately the same length as the test 
oligonucleotide and that the DNA sequence of the oligonucleotide differs from the 
antisense sequence no more than is necessary to prevent specific hybridization 
5 to the target sequence. 

While antisense DNA sequences complementary to the target gene coding 
region sequence could be used, those complementary to the transcribed, 
untranslated region are most preferred. 

For expression of the biologically active RNA, e.g., an antisense RNA 

10 molecule, from the vector of the invention the DNA sequence encoding the bio- 
logically active RNA molecule is operatively linked to a strong pol ill or pol II pro- 
moter. The use of such a construct to transfect target cells in the patient will re- 
sult in the transcription of sufficient amounts of single stranded RNAs thafcwill 
form complementary base pairs with the endogenous target gene transcripts and 

15 thereby prevent translation of the target gene mRNA. For example, a vector of 
the invention can be introduced, e.g., such that it is taken up by a cell and directs 
the transcription of an antisense RNA. Such vectors can be constructed by re- 
combinant DNA technology methods standard in the art. Expression of the se- 
quence encoding the antisense RNA can be by any promoter known in the art to 

20 act in mammalian, preferably human cells. Such promoters can be inducible or 
constitutive. Such promoters include but are not limited to: the SV40 early pro- 
moter region [Bernoist and Chambon, Nature 290 (1 981 ) 304-31 0], the promoter 
contained in the 3 long terminal repeat of Rous sarcoma virus [Yamamoto, et al., 
Cell 22 (1980) 787-797], the herpes thymidine kinase promoter [Wagner, et al., 

25 Proc. Natl. Acad. Sci. U.S.A. 78 (1981) 1441-1445], the regulatory sequences of 
the metallothionein gene [Brinster, et al., 1982, Nature 296 (1982) 39-42], etc. 

In certain embodiments of the invention, a vector of the invention contains 
a DNA sequence, which encodes a ribozyme. Ribozyme molecules designed to 
catalytically cleave target gene mRNA transcripts can also be used to prevent 

30 translation of a target gene mRNA and, therefore, expression of a target gene 
product [see, e.g., PCT international Publication WO90/1 1364, published October 
4, 1990; Sarver, et al., Science 247 (1990) 1222-1225]. 
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Ribozymes are enzymatic RNA molecules capable of catalyzing the spe- 
cific cleavage of RNA. [For a review, see Rossi, Current Biology 4 (1994) 469- 
471]. The mechanism of ribozyme action involves sequence specific hybridiza- 
tion of the ribozyme molecule to complementary target RNA, followed by an en- 
5 donucleolytic cleavage event. The composition of ribozyme molecules must in- 
clude one or more sequences complementary to the target gene mRNA, and 
must include the well known catalytic sequence responsible for mRNA cleavage. 
For this sequence, see, e.g., U.S. Patent No. 5,093,246, which is incorporated 
herein by reference in its entirety. 
10 While ribozymes that cleave mRNA at site-specific recognition sequences 

can be used to destroy target gene mRNAs, the use of hammerhead ribozymes is 
preferred. Hammerhead ribozymes cleave mRNAs at locations dictated by flank- 
ing regions that form complementary base pairs with the target mRNA. The sole 
requirement is that the target mRNA have the following sequence of two bases: 
15 5-UG-3'. The construction and production of hammerhead ribozymes is well 
known in the art and is described more fully in Myers, 1995, Molecular Biology 
and Biotechnology: A Comprehensive Desk Reference, VCH Publishers, New 
York, (see especially Figure 4, page 833) and in Haseloff & Gerlach, Nature, 334 
1988) 585-591, which is incorporated herein by reference in its entirety. 
20 Preferably the ribozyme is engineered so that the cleavage recognition site 

is located near the 5' end of the target gene mRNA, l.e. 9 to increase efficiency 
and minimize the intracellular accumulation of non-functional mRNA transcripts. 

The ribozymes of the present invention also include RNA endoribonucle- 
ases (hereinafter "Cech-type ribozymes") such as the one that occurs naturally in 
25 Tetrahymena thermophila (known as the IVS, or L-19 IVS RNA) and that has 
been extensively described by Thomas Cech and collaborators [Zaug, et al., Sci- 
ence, 224 (1984) 574-578; Zaug and Cech, Science, 231 (1986)470-475; Zaug, 
et al., Nature, 324 (1 986) 429-433; published International patent application No. 
WO 88/04300 by University Patents Inc.; Been & Cech, Cell, 47 (1986)207-216]. 
30 The Cech-type ribozymes have an eight base pair active site, which hybridizes to 
a target RNA sequence whereafter cleavage of the target RNA takes place. The 
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invention encompasses those Cech-type ribozymes, which target eight base-pair 
active site sequences that are present in the target gene. 

Expression of a ribozyme can be under the control of a strong constitutive 
pol III or pol II promoter, so that transfected cells will produce sufficient quantities 
5 of the ribozyme to destroy endogenous target gene messages and inhibit transla- 
tion. Because ribozymes unlike antisense molecules, are catalytic, a lower intra- 
cellular concentration is required for efficiency. 

In instances wherein the antisense and/or ribozyme molecules described 
herein are utilized to inhibit mutant gene expression, it is possible that the tech- 

10 nique may so efficiently reduce or inhibit the translation of mRNA produced by 
normal target gene alleles that the possibility may arise wherein the concentration, 
of normal target gene product present may be lower than is necessary for a nor- 
mal phenotype. In such cases, to ensure that substantially normal levels of target 
gene activity are maintained, therefore, nucleic acid molecules that encode and 

15 express target gene polypeptides exhibiting normal target gene activity may; be 
introduced into cells via gene therapy methods such as those described, below, in 
Section 5.4.4 that do not contain sequences susceptible to whatever antisense, 
ribozyme, or triple helix treatments are being utilized. Alternatively, in instances 
whereby the target gene encodes an extracellular protein, it may be preferable to 

20 co-administer normal target gene protein in order to maintain the requisite level of 
target gene activity. 

Methods of administering the ribozyme and antisense RNA molecules are 
well known in the art and/or described in section 5.4.6. 

25 5.4.6 PHARMACEUTICAL FORMULATIONS AND MODES OF 

ADMINISTRATION 

In a preferred aspect, a pharmaceutical of the invention comprises a sub- 
stantially purified vector of the invention (e.g., substantially free from substances 
that limit its effect or produce undesired side-effects). The subject to whom the 
30 pharmaceutical is administered in the methods of the invention is preferably an 
animal, including but not limited to animals such as cows, pigs, horses, chickens, 
cats, dogs, efc, and is preferably a mammal, and most preferably a human. 
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In certain embodiments, the vector of the invention is directly administered 
in vivo, where the DNA sequence of interest is expressed to produce the encoded 
product. This can be accomplished by any of numerous methods known in the 
art. The vectors of the invention can be administered so that the nucleic acid se- 
5 quences become intracellular. The vectors of the invention can be administered 
by direct injection of naked DNA; use of microparticle bombardment (e.g., a gene 
gun; Biolistic, Dupont); coating with lipids or cell-surface receptors ortransfecting 
agents; encapsulation in microparticles or microcapsules; administration in link- 
age to a peptide which is known to enter the nucleus; administration in linkage to 
10 a ligand subject to receptor-mediated endocytosis [see, e.g., Wu and Wu, J. Biol. 
Chem. 262 (1 987) 4429-4432] (which can be used to target cell types specifically 
expressing the receptors); etc. In a specific embodiment, the compound or com- 
position can be delivered in a vesicle, in particular a liposome [see Langer, Sci- 
ence 249 (1990) 1527-1533; Treat ef a/., 1989, in Liposomes in the Therapy of 
15 Infectious Disease and Cancer, Lopez-Berestein and Fidler (eds.), Liss, New 
York, pp. 353- 365; Lopez-Berestein, ibid., pp. 317-327]. 

In certain embodiments, the vector of the invention is coated with lipids or 
cell-surface receptors or transfecting agents, or linked to a homeobox- like pep- 
tide which is known to enter the nucleus [see e.g., Joliot et a/., Proc. Natl. Acad. 
20 Sci. USA 88 (1991) 1864-1868], etc. 

In certain other embodiments, nucleic acid-ligand complexes can be 
formed in which the ligand comprises a fusogenic viral peptide to disrupt en- 
dosomes, allowing the nucleic acid to avoid lysosomal degradation. 

In yet other embodiments, the vector of the invention can be targeted in 
25 vivo for cell specific uptake and expression, by targeting a specific receptor (see, 
e.g., PCT Publications WO 92/06 180; WO 92/22635; W092/20316; 
W093/14188, and WO 93/20221), . 

Methods for use with the invention include, but are not limited to, intrader- 
mal, intramuscular, intraperitoneal, intravenous, subcutaneous, intranasal, epidu- 
30 ral, and oral routes. Methods for use with the invention further include admini- 
stration by any convenient route, for example by infusion or bolus injection, by 
absorption through epithelial or mucocutaneous linings (e.g., oral mucosa, rectal 
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and intestinal mucosa, etc.). In a specific embodiment, it may be desirable to 
administer a vector of the invention by injection, by means of a catheter, by 
means of a suppository, or by means of an implant, said implant being of a po- 
rous, non-porous, or gelatinous material, including a membrane, such as a sialas- 
5 tic membrane, or a fiber. Care must be taken to use materials to which the vector 
does not absorb. Administration can be systemic or local. 

In certain embodiments, a vector of the invention is administered together 
with other biologically active agents such as chemotherapeutic agents or agents 
that augment the immune system. 

10 In yet another embodiment, methods for use with the invention include de- 

livery via a controlled release system. In one embodiment, a pump may be used 
[see Langer, supra; Sefton, CRC Crit. Ref. Biomed. Eng. 14 (1989) 201; Buch- 
wald etal., Surgery 88 (1980) 507; Saudek et a/., N, Engl. J. Med. 321 (1989) 
574]. In another embodiment, polymeric materials can be used [see Medical Ap- 

15 plications of Controlled Release, 1974, Langer and Wise (eds.), CRC Pres., Boca 
Raton, Florida; Controlled Drug Bioavailability, Drug Product Design and Per- 
formance, 1984, Smolen and Ball (eds.), Wiley, New York; Ranger and Peppas, 
Macromol. Sci. Rev. Macromol. Chem. 23 (1983) 61; see also Levy et a/., Sci- 
ence 228 (1985) 190; During etal., Ann. Neurol. 25 (1989) 351; Howard et a/., J. 

20 Neurosurg. 71 (1989) 105]. 

Other controlled release systems are discussed in the review by Langer, 
Science 249 (1990)1527-1533. 

Pharmaceutical compositions of the invention comprise a therapeutically 
effective amount of a vector of the invention, and a suitable pharmaceutical vehi- 

25 cle. In a specific embodiment, the term "suitable pharmaceutical vehicle" means 
approved by a regulatory agency of the Federal or a state government or listed in 
the U.S. Pharmacopeia or other generally recognized pharmacopeia for use in 
animals, and more particularly in humans. The term "vehicle" refers to a diluent, 
adjuvant, excipient, or vehicle with which the therapeutic is administered. Such 

30 suitable pharmaceutical vehicles can be sterile liquids, such as water and oils, in- 
cluding those of petroleum, animal, vegetable or synthetic origin, such as peanut 
oil, soybean oil, mineral oil, sesame oil and the like. Water is a preferred carrier 
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when the pharmaceutical composition is administered intravenously. Saline solu- 
tions and aqueous dextrose and glycerol solutions can also be employed as liquid 
carriers, particularly for injectable solutions. Suitable pharmaceutical excipients 
include starch, glucose, lactose, sucrose, gelatin, malt, rice, flour, chalk, silica gel, 

5 sodium stearate, glycerol monostearate, talc, sodium chloride, dried skim milk, 
glycerol, propylene, glycol, water, ethanol and the like. The composition, if de- 
sired, can also contain minor amounts of wetting or emulsifying agents, or pH 
buffering agents. These compositions can take the form of solutions, suspen- 
sions, emulsion, tablets, pills, capsules, powders, sustained-release formulations 

10 and the like. The composition can be formulated as a suppository, with traditional 
binders and carriers such as triglycerides. Oral formulation can include standard 
carriers such as pharmaceutical grades of mannitol, lactose, starch, magnesium 
stearate, sodium saccharine, cellulose, magnesium carbonate, etc. Examples of 
suitable pharmaceutical carriers are described in "Remington's Pharmaceutical 

15 Sciences" by E.W. Martin. Such compositions will contain a therapeutically ef- 
fective amount of the nucleic acid or protein of the invention, preferably in purified 
form, together with a suitable amount of carrier so as to provide the form for 
proper administration to the patient. The formulation should suit the mode of ad- 
ministration. 

20 in a specific embodiment, the pharmaceutical of the invention is formu- 

lated in accordance with routine procedures as a pharmaceutical composition 
adapted for intravenous administration to human beings. Typically, compositions 
for intravenous administration are solutions in sterile isotonic aqueous buffer. 
Where necessary, the pharmaceutical of the invention may also include a solubi- 

25 lizing agent and a local anesthetic such as lignocaine to ease pain at the site of 
the injection. Generally, the ingredients are supplied either separately or mixed 
together in unit dosage form, for example, as a dry lyophilized powder or water 
free concentrate in a hermetically sealed container such as an ampoule or sa- 
chette indicating the quantity of active agent. Where the pharmaceutical of the in- 

30 vention is to be administered by infusion, it can be dispensed with an infusion botr 
tie containing sterile pharmaceutical grade water or saline. Where the pharma- 
ceutical of the invention is administered by injection, an ampoule of sterile water 
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for injection or saline can be provided so that the ingredients may be mixed prior 
to administration. 

For buccal administration the compositions may take the form of tablets or 
lozenges formulated in conventional manner. 
5 For administration by inhalation, the compounds for use according to the 

present invention are conveniently delivered in the form of an aerosol spray pres- 
entation from pressurized packs or a nebulizer, with the use of a suitable propel- 
lant, e.g., dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoro- 
ethane, carbon dioxide or other suitable gas. In the case of a pressurized aerosol 

10 the dosage unit may be determined by providing a valve to deliver a metered 
amount. Capsules and cartridges of e.g. gelatin for use in an inhaler or insufflator 
may be formulated containing a powder mix of the compound and a suitable 
powder base such as lactose or starch. 

The amount of a vector of the invention, which will be effective in the treat- 

15 ment or prevention of the indicated disease, can be determined by standard clini- 
cal techniques. In addition, in vitro assays may optionally be employed to help 
identify optimal dosage ranges. The precise dose to be employed in the. for- 
mulation will also depend on the route of administration, and the stage of indi- 
cated disease, and should be decided according to the judgment of the practitio- 

20 ner and each patient's circumstances. Effective doses may be extrapolated from 
dose-response curves derived from in vitro or animal model test systems. 

The present invention may be better understood by reference to the follow- 
ing non-limiting Examples, which are provided as exemplary of the invention. The 
following examples are presented in order to more fully illustrate the preferred 

25 embodiments of the invention. They should in no way be construed, however, as 
limiting the broad scope of the invention. 

6. EXAMPLES 
6.1. EXAMPLE 1 

30 Cloning and analysis of the expression properties of the vectors su- 

pers and super6wt 
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The vector plasmids super6 (Figure 1) and super6wt were prepared from 
previous generation based gene vaccination vectors VI (Figure 2) and Vlwt, re- 
spectively. Vectors VI and Vlwt are principally synthetic bacterial plasmids that 
contain a transposon Tn903 derived kanamycin resistance marker gene [Oka, A., 

5 et al., J Mol Biol 147 (1981) 217-226] and a modified form of pMB1 replicon [Ya- 
nisch-Perron, C, et al., Gene 33 (1985) 103-119] needed for the propagation in 
Escherichia coli cells. Vectors VI and Vlwt also contain a Cytomegalovirus Imme- 
diately Early Promoter combined with a HSV1 TK leader sequence and rabbit (3- 
globin gene sequences, which both are derived from plasmid pCG (Tanaka, M., 

10 et al., 60 (1990) Cell 375-386]. The latter elements are needed for expressing 
from the nef coding sequence derived from a HAN2 isolate of the HIV-1 strain 
[Sauermann, U., et al., AIDS Research. Hum. Retrov. 6 (1 990) 81 3-823]. The ex- 
pression vectors for the Nef carry clustered ten high affinity E2 binding sites (de- 
rived from plasmid pUC1910BS, unpublished) just upstream of the CMV pro- 

15 moter. 

The parent vector VI contains a modified E2 coding sequence: the hinge 
region of E2 (amino acids 192-31 1 ) is replaced with four glycine-alanine repeats 
from EBNA1 protein of Epstein-Barr Virus [Baer, R. J., et'al., Nature 310 (1984) 
207-21 1]. The protein encoded by this sequence was named as E2d192-31 1+4G. 

20 The parent vector Vlwt contains an expression cassette for wild type E2 protein of 
the bovine papilloma virus type 1 with point mutations introduced for the elimina- 
tion E3 and E4 open reading frame (ORF) coding capacity by two stop codons 
into both these ORFs. In the vectors the E2 coding sequences are cloned be- 
tween a Rous sarcoma virus proviral 5' LTR [Long, E. O., et al., Hum. Immunol. 

25 31 (1 991 ) 229-235] and bovine growth hormone polyadenylation region [Chesnut, 
J. D., et al., J Immunol Methods 193 (1996) 17-27]. 

Plasmid vectors super6 and super6wt were constructed by deleting from 
the respective parent vectors VI and Vlwt all beta-globin sequences downstream 
of the nef gene except the second intron of the rabbit beta-globin gene. The beta- 

30 globin sequences (especially the fragment of the exon) show some homology 
with sequences in the human beta-globin gene, whereas the intron lacks any sig- 
nificant homology to human genomic sequences. The intron was amplified by 
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PCR from the plasmid pCG IJanaka, M. et a!., Cell 60 (1990) 375-386] using oli- 
gonucleotides with some mismatches for modifying the sequences of splicing do- 
nor and acceptor sites of the intron to the perfect match to consensus motifs. The 
Herpes Simplex Virus type 1 thymidine kinase gene polyadenylation region from 
5 pHook [Chesnut, J. D., et al., J Immunol Methods 193 (1996) 17-27] was then 
cloned just next to the 3-end of the intron, because in parent plasmids the rabbit 
|3-globin polyadenylation signal were used. 

The expression properties of the Nef and E2 proteins expressed by the 
plasmid vectors super6 and super6wt were analyzed and compared with the ex- 

10 pression properties of the Nef and E2 proteins expressed by VI and VI wt by Wes- 
tern blotting [Towbin et al., Proc Natl Acad Sci USA 76 (1979) 4350-4354] with 
monoclonal antibodies against Nef and E2. First, Jurkat cells (a human T-cell 
lymphoblast cell line) were transfected by electroporation [Ustav et al. EMBO J 2 
(1991) 449-457] with 1 pg of super6, super6WT or equimolar amounts of; the 

15 plasmids VI, Vlwt. As a control an equimolar amount of vector II (Figure 3), which 
contains an identical Nef cassette but no E2 coding sequence, was used. Carrier 
. DNA was used as a negative control. Briefly, the plasmid and carrier DNA were 
mixed with the cell suspension in a 0.4 cm electroporation cuvette (BioRad Labo- 
ratories, Hercules, USA) followed an electric pulse (200V; 1 mF) using Gene Pul- 

20 ser IITM with capacitance extender (BioRad Laboratories, Hercules, USA). 

Forty-nine hours post-transfection the cells were lysed by treating with a 
sample buffer containing 50mM Tris-HCI pH 6.8; 2% SDS, 0.1% bromophenol 
blue, 100mM dithiothreitol, and 10% (v/v) glycerol. The lysates were run on a 
1 0% or 1 2.5% SDS-polyacrylamide gel and subsequently transferred onto a 0.45 

25 pm PVDF nitrocellulose membrane (Millipore). The membrane was first blocked 
overnight with a blocking solution containing 5% dry milk (fat-free), 0.1 % Tween 
20 in 50 mM Tris-HCI pH 7.5; 150mM NaCI and thereafter incubated for 1h with 
diluted monoclonal anti-Nef antisera (1:100) or anti-E2-antisera (1:1000) in the 
blocking solution. After each incubation step, unbound proteins were removed by 

30 washing strips three times with TBS - 0.1 % Tween-20. The binding of primary 
immunoglobulins was detected by incubating the strips with horse raddish peroxi- 
dase conjugate anti-mouse IgG (Labas, Estonia) followed by visualization using a 
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chemoluminesence detection system (Amersham Pharmacia Biotech, United 
Kingdom). 

The results are shown in Figure 4. The expression of the Nef protein is 
shown on panel A and the expression of E2 protein on panel B. The arrows indi- 

5 cate the right molecular sizes of the Nef and E2 proteins. The expression level of 
the E2d1 92-31 1+4GA is very low and for this reason cannot be seen on the blot 
presented in Figure 4. 

The amounts of Nef expressed from the plasmids super6, super6wt, VI 
and Vlwt (lanes 1-4 in Figure 4A) are quite similar (Figure 4, panel A, lanes 1 to 

10 4). Much less protein is produced from plasmid II (lane 5). The expression levels 
of the Nef protein are higher from vectors containing wtE2 (cf. lane 1 compared 
with lane 2 and lane 3 compared with lane 4). This is in accordance with the ex- 
pression levels of E2 and E2d1 92-31 1 +4GA proteins from these plasmids (Figure 
4, panel B). 

15 

6.2. EXAMPLE 2 

Cloning and analysis of the expression properties of plasmids in se- 
ries productl and NNV 

To increase the copy number of the vectors super 6 and super6wt in Es- 
20 cherichia coli further modifications were made in these vectors. The Tn903 
kanamycin resistance gene, pMB1 replicon and ten E2 binding sites were re- 
moved by Hindlll/Nhel digestion followed by replacing with the Hind lll/Nhel frag- 
ment from retroviral vector pBabe Neo [Morgenstern, J.P. and Land, H., Nucleic 
Acids Research 1 8 (1990) 3587 - 3596]. This fragment contains a modified pMB1 
25 replicon and the Tn5 kanamycin resistance gene that allow relaxed high copy- 
number replication of the plasmids in bacteria. The new plasmids were named as 
the productl (Figure 5), and productl wt respectively. An unsuccessful attempt to 
reinsert the ten E2 binding sites back into the blunted Nhel site upstream of the 
CMV promoter of the productl resulted in vector New Vector NNV, respectively, 
30 with only two binding sites integrated in the plasmid. 

Additional ten E2 binding sites were inserted from plasmid pUC1910BS in- 
to the New Vector in just downstream the E2 expression cassette. These new 
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vectors were named NNV-1 and NNV-2 (Figure 6A). For replacing the E2d192- 
31 1 +4GA with wt E2 (with deleted E3 and E4 ORFs), the E2d1 92-31 1 +4GA cod- 
ing sequence containing Bsp1 20I fragment was replaced with wtE2 containing an 
analogous Bsp120l fragment from the super6wt. Generated plasmids were 

5 named NNV-1 wt and NNV-2wt (Figure 6B), respectively. The numbers 1 or 2 in 
vectors of the NNV series mark the orientation of the 10 E2 binding sites region 
relative to the E2 expression cassette. 

The expression properties of the Nef protein from the NNV plasmids, i.e. 
NNV-1 , NNV-2, NNVwt and NNV-2wt, after the transfection of Jurkat cells by 

10 electroporation at a concentration of 1 ig of the plasmid were analyzed and com- 
pared with the expression properties of the Nef proteins from super6 and su- 
per6wt by Western blotting essentially as described in Example 1 . The amounts 
of super6 and super6wt used for the transfection were 0,95 and 1 1g, respectively. 
The results are shown in Figure 7, 

15 ■ NNV-1 and NNV-2 vectors have expression potential similar to plasmid 

super6 as evident from the comparison of lanes 1 and 2 on figure 8 with lane 5. 
The same applies to vectors NNV-1 wt, NNV-2wt and super6wt (compare lanes 3 
and 4 with lane 6 on figure 7). In accordance with the previous results the plas- 
mids expressing wt E2 produce more Nef protein than E2d 1 92-31 1+4GA vectors 

20 do (compare lane 1 with lane 3 and lane 2 with lane 4 in figure 7). in view of this 
and since the Nef expression from NNV-2wt was slightly higher than that from 
NNV-1 wt, vector NNV-2wt was selected for further tests. 

6.3 EXAMPLE 3 
25 Analysis of the expression properties of NNV-2wt 

To analyse the expression properties of NNV-2wt, four different cell lines, 
i.e. the Jurkat (human T-cell lymphoblasts), P815 (mouse mastocytoma cells), 
CHO (Chinese Hamster Ovary cells) lines and RD (human embryo rhabdomyo- 
sarcoma cells), were transfected by electroporation and analyzed for their ex- 
30 pression of Nef of and E2. To reveal the transcription activation and maintenance 
properties mediated by E2 protein and E2 oligomerized binding sites productl wt, 
which lacks the E2 binding sites (Figure 5), was used as a control. An additional 
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control plasmid was plasmid NNV-2wtFS, which differs from NNV-2wt by contain- 
ing a frameshift introduced into E2 coding sequence, whereby it does not express 
functional E2 protein. 

Each cell line was transfected with different amounts of the vector PNA by 
5 electroporation essentially as described in Example 1. Time-points were taken 
approximately two arid five days after transfection. The results of analyses are 
presented in figures 8 to10. 

The Jurkat cells were transfected with 0.5 pg or 2 pg of the NNV-2wt 
(lanes 1,2, 8, and 9 in Figure 8) and equal amounts of the plasmids NNV-2wtFS 
10 (lanes 3, 4, 10, and 11 in Figure 8) and productlwt (lanes 5, 6, 12, and13 in Fig- 
ure 8) or carrier only (lanes 7 and 14 in Figure 8). Time-points were taken 44 
hours (lanes 1-7) and 1 14 hours (lanes 8-14) aftertransfection: The expression of 
the Nef and E2 proteins was analyzed by Western blotting essentially as de- 
scribed in Example 1. 
15 The P815 cells were transfected with 0.5 pg or 2 pg of the NNV-2wt (lanes 

1 ,2, 8, and 9 in Figure 9) and equal amounts of the plasmids NNV-2wtFS (lanes 
3, 4, 10, and 1 1 in Figure 9) and productlwt (lanes 5, 6, 1 2, and 1 3 in Figure 9) or 
carrier only (lanes 7 and 14 in Figure 9). Time-points were taken 45 hours (lanes 
1-7) and 1 19 hours (lanes 8-14) aftertransfection: The expression of the Nef pro- 
20 teins was analyzed by Western blotting essentially as described in Example 1 . 
The blot with anti-E2 antibodies 1 1 9h post-transfection is not shown, because no 
special signal could be detected. Generally, the expression level of the Nef pro- 
tein correlated with the expression level of E2 protein in these cells, which con- 
firms the fact that the function of the E2 protein is to activate the transcription and 
25 to help the plasmid to be maintained for a longer time in the proliferating cells. 

The CHO cells were transfected with 0.5 pg or 2 pg of the NNV-2wt (lanes 
1 ,2, 8, and 9 in Figure 10) and equal amounts of the plasmids NNV-2wtFS (lanes 
3, 4, 10, and 1 1 in Figure 10) and productlwt (lanes 5, 6, 12, and 13 in Figure 10) 
or carrier only (lanes 7 and 14 in Figure 10). Time-points were taken 48 hours 
30 (lanes 1-7) and 114 hours (lanes 8-14) aftertransfection. The expression of the 
Nef and E2 proteins was analyzed by Western blotting essentially as described in 
Example 1 . 
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The RD cells were transfected with 0.5 pg or 2 pg of the NNV-2wt (lanes 
1 ,2, 8, and 9 in Figure 1 1 ) and equal amounts of the plasmids NNV-2wtFS (lanes 
3,4; 10, and 11 in Figure 11) and productlwt (lanes 5, 6, 12, and 13 in Figure 11) 
or carrier only (lanes 7 and 14 in Figure 11). Time-points were taken 39 hours 
5 (lanes 1-7) and 110 hours (lanes 8-14) after transfection. The expression of the 
Nef protein was analyzed by Western blotting essentially as described in Exam- 
ple! 

In all four cell lines the expression level of the Nef protein, taken at earlier 
time points (lanes 1-7 in figures 8 - 1 1) and at later time points (lanes 8-14 in Fig- 

10 ures 8-1 1 ) hours, from the NNV-2wt was higher than from control vectors. The 
superiority of the NNV-2wt was more obvious at later time-points as evident from 
the comparision of lane 8 with lanes 10 and 12 in Figure 8, and also from the 
comparision of lane 9 with lanes 11 and 13 in figures 8, 9 and 10. 

The expression pattern of RNA from these plasmids was also analyzed us- 

15 ing the Northern analysis [Alwine, J. C, et al., Proc Natl Acad Sci U S A 74 (1977) 
5350-5354] for the NNV-2wt vector. For this purpose, Jurkat and CHO cells were 
transfected with 2 pg of the NNV-2wt. For the transfection of P815 cells 10 pg of 
NNV-2wt were used. The transfections were made essentially as described in 
Example 1 . Forty-eight hours post-transfection total RNA was extracted using 

20 RNAeasy kit (Qiagen) and samples containing 21 pg (P815), 15 pg (CHO) or 10 
pg (Jurkat) of the RNA were analysed by electrophoresis under the denaturing 
conditions (1.3% agarose gel containing 20mM MOPS pH 7.0; 2mM NaOAc; 
1 mM EDTA pH 8.0; 2.2M formaldehyde). The running buffer contained the same 
components except formaldehyde. The samples were loaded in a buffer contain- 

25 ing formamide and formaldehyde. After the electrophoresis the separated RN As 
were blotted onto the HybondN+ membrane (Amersham Pharmacia Biotech, 
United Kingtom) and hybridization with a radio-labeled nef coding sequence, E2 
coding sequence or whole vector probes was carried out. The RNA from cells 
transfected with the carrier was used as a control. The results of the Northern blot 

30 analyses are shown in figure 12. 

The results indicate that no other RNA species than complementary 
mRNAs for E2 and nef are expressed from the vector, since no additional signals 
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can be detected with the whole vector probe compared with net and E2 specific 
hybridizations (compare lanes 1-12 with lanes 13-18 in Figure 12). 

6.4 EXAMPLE 4 

5 Analysis of the attachment of the NNV-2wt to mitotic chromosomes 

The attachment of the NNV-2wt to mitotic chromosomes in CHO cells was 
analyzed by fluoresence in situ hybridisation (FISH) [Tucker J.D., et al., In: 
J.E.Celis (ed.), Cell Biology: A Laboratory Handbook, vol 2, p. 450-458. Academic 
Press, Inc. New York, NY. 1994.]. 

1 o Thirty-six hours post-transfection the CHO cells by electroporation with 1 

ug of NNV-2wt or with equimolar amounts of the control plasmids NNV-2wtFS 
and productlwt (performed essentially as described in Example 1) the cultures 
were treated with colchicin (Gibco) for arresting the cells in metaphase of the mi- 
tosis. Briefly, cells were exposed to colchicine added to medium at final concen- 

1 5 tration of 0.1 ug/ml for 1 -4 h to block the cell cycle at mitosis. Blocked cells were 
harvested by a trypsin treatment and suspended in a 0.075M KCI solution, incu- 
bated at room temperature for 1 5 min, and fixed in ice-cold methanol-glacial ace- 
tic acid (3:1, vol/vol). The spread-out chromosomes at metaphase and nuclei at 
interphase for fluorescence in situ hybridization analyses were prepared by drop- 

20 ping the cell suspension on wet slides. Several slides from one culture were pre- 
pared. 

Hybridization probes were generated by nick-translation, using biotin-1 6- 
dUTP as a label and plasmid Productlwt as template. A typical nick-translation 
reaction mixture contained a nick-translation buffer, unlabeled dNTPs, biotin-1 6- 

25 dUTP, and E.coli DNA polymerase. 

Chromosome preparations were denatured at 70*C in 70% formamide (pH 
7.0-7.3) for 5 min, then immediately dehydrated in a series of washes (70%, 80%, 
and 96% ice-cold ethanol washes for 3 min each), and air-dried. The hybridiza- 
tion mixture (18 pi per slide) was composed of 50% formamide in 2xSSC (1xSSC 

30 is 0.15 M NaCI plus 0.015 M sodium citrate), 10% dextran sulfate, 150 ng of 
biotinylated plasmid probe DNA and 10 pg of herring sperm carrier DNA. After 5 
min of denaturation at70"C, probe DNA was applied to each slide, sealed under 
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a coverslip, and hybridized for overnight at 37°C in a moist chamber. The slides 
were washed with three changes of 2xSSC, nd 2xSSC containing 0.1% IGEPAL 
CA-630 (Sigma Chemical Co.) at45°C. Prior to the immunofluorescence detec- 
tion, slides were preincubated for 5 min in PNM a buffer [PN buffer (25.2 g 

5 Na 2 HP0 4 -7 H 2 0, 083 g NaH 2 P0 4 •_□ H 2 0 and 0.6 ml of IGEPAL CA-630 in 1 li- 
ter of H 2 0] with 5% nonfat dried milk and 0.02% sodium azide). 

Subsequently, the probe was detected with fluorescein isothiocyanate 
(FITC)-conjugated extravidin. The signal was amplified with biotinylated antiavidin 
antibody and a second round of extravidin-FlTC tretment. Between each of the 

10 steps, the slides were washed in PN buffer containing 0.05% IGEPAL CA-630 at 
room temperature for 2x5 min. Chromosomes were counterstained with 
propidium iodide and mounted in p-phenylenediamine antifade mounting me- 
dium. 

Slides were analyzed with a Olympus VANOX-S fluorescence microscope 
15 equipped with appropriate filter set. 

The results are shown in Table 1 . 



TABLE 1. Chromosomal attachment of the NNV-2wt. 



Culture 


Meta- 
phases with epi- 
somal signal on 
chromosomes 


Ana- 
lyzed metaphases 


% 


0.5ug 

NNV-2wt 


11 


158 


7 


0.5ug 
NNV-2wtFS 


0 


100 


0 


0.48pg 
productlwt 


0 


100 


0 


carrier 


0 


100 


0 
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The data indicate clearly that the E2 protein and its binding sites are nee- 
ded forthe chromosomal attachment because only the NNV-2wt but not two other 
vectors have this ability. 

5 6.5 EXAMPLE 5 

Stability of NNV-2wt during propagation in bacterial cells 

The stability of NNV-2wt during propagation in bacterial cells was tested. 
The plasmid NNV-2wt was mixed with competent Escherichia coli cells of the 
DH5alpha strain [prepared as described in Inoue, H., et al., Gene 96 (1990) 23- 
10 28] and incubated on ice for 30 minutes. Subsequently, the cell suspension was 
subjected to a heat-shock for 3 minutes at 37°C followed by a rapid cooling on 
ice. One milliliter of LB medium was added to the sample and the mixture was in- 
cubated for 45 minutes at 37°C with vigorous shaking. Finally, a portion of the 
cells was plated onto dishes containing LB medium with 50 ug/ml of kanamycin. 
15 On the next day, the cells from a single colony were transferred onto the new 
dishes containing the same medium. This procedure was repeated until four gen- 
erations of bacteria had been grown, and the plasmid DNA from the colonies of 
each generation was analyzed. 

One colony from each generation was used for an inoculation of 2 ml LB 
20 medium containing 50 ug/ml of kanamycin followed by an overnight incubation at 
37°C with vigorous shaking. The cells were harvested and the plasmid DNA was 
extracted from the cell using classical lysis by boiling. [Sambrook, S., et al., Mo- 
lecular Cloning A Laboratory Manual. Second ed. Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, New York]. The samples were digested with restric- 
25 tion endonuclease Xbal (Fermentas, Lithuania) and analyzed by agarose gel 
electrophoresis in comparison with the original DNA used for transformation. The 
results are shown in Figure 13. 

As can be seen in Figure 1 3, the vector is stable during the passage in Es- 
cherichia coli cells: no colonies with re-arrangements were observed when com- 
30 pared with the DNA used for transformation (lane 9). 
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6.6 EXAMPLE 6 

Stability of NNV-2wt in eukaryotic ceils 

The stability of the pIasmid.NNV-2wt as a non-replicating episomal ele- 
ment was also analyzed in eukaryotic cells. For this purpose the CHO and Jurkat 
5 cells were tran'sfected with 2 pg of NNV-2wt. Total DNAs of the cells were ex- 
tracted at 24, 72 or 96 hours post-transfection. Briefly, the cells were lyzed in 
20mM Tris-HCI pH 8.0; 10mM EDTA pH 8.0; 100mM NaCI; 0.2% SDS; in pres- 
ence of 200 pg/ml of proteinase K (Fermentas, Lithuania). Next, the samples 
were extracted sequentially with phenol and with chloroform and precipitated with 

10 ethanol. The nucleic acids were resuspended in 10mM Tris-HCI pH 8.0; 1mM 
EDTA pH 8.0;' 20 pg/ml of RNase A (Fermentas, Lithuania) and incubated for 1 
hour at 37°C. Finally the DNA was re-precipitated with ammonium acetate and 
ethanol, washed with 70% ethanol and resuspended in 10mM Tris-HCI pH 8.0; 
1mM EDTA pH 8.0. The samples were digested with different restriction endonu- 

15 cleases: with Eco81 1 (Fermentas, Lithuania) that has two recognition sites on the 
plasmid, with Hindlll (Fermentas, Lithuania) that does not cut the NNV-2wt DNA 
and with Dpnl (New England Biolabs, USA) that digest only DNA synthesized in 
Escherichia coli cells. Restricted DNAs were separated on TAE agarose electro- 
phoresis and analyzed by Southern blotting [Southern, E.M. J. Mol. Biol. 98 

20 (1975) 503-517] with a vector specific radiolabeled probe. The results are illus- 
trated on figure 14. As obvious from comparison of the fragment sizes of Eco81 1 
digestion (lanes 1 , 2 and 7 in figure 14) with respective marker lanes no arrange- 
ments of the vector were detected in the assay. Neither were signals observed at 
a position different from the marker lanes in cases of the Hind III (lanes 3, 4 and 8 

25 in figure 14) or Hindlll/Dpnl (lanes 5, 6 and 9 in figure 14) digestion indicating that 
integration and/or replication events were not observed. 

6.7 EXAMPLE 7 

Analysis of replication of the NNV-2wt in the presence of human 
30 papillomaviral replication factors 

It has been demonstrated previously that papillomaviral proteins are able 
to initiate the replication of heterologous ori-containing plasmidsfrom many other 
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human and animal papillomaviruses [Chiang, C. M., et al., Proc Natl Acad Sci U S 
A 89 (1992) 5799-5803]. Although NNV-2wt does not contain an intact viral origin 
of replication, it was tested how the replication is initiated in the presence of hu- ' 
man papillomavirus type 1 E1 and E2 proteins. CHO cells were transfected with 
one microgram of either plasmids NNV-2wt, NNV-2wtFS or product 1 alone or 
with 4.5 pg of the HPV-1 1 E1 expression- vector pMT/E1 HPV1 1 or with same 
amount of pMT/E1 HPV11 and 4.5 pg HPV-1 1 E2 protein expression vector 
pMT/E2 HPV 1 1 as indicated on the top of the figure 1 5. Transfections were done 
essentially as described in Example 1. E1 and E2 expression vectors are de- 
scribed previously (Chiang, C. M. et al., supra). An equimolar amount of HPV-1 1 
replication origin containing plasmid HPV110RI was transfected with the same 
expression vectors as a positive control. 

Low-molecular weight DNA was extracted by modified Hirt lysis [Ustav, et 
al., EMBO J 2 (1991) 449-457] at 67 hours post-transfection. Briefly, the cells 
washed with PBS were lyzed on ice at 5 minutes by adding alkaline lysis solutions 
I (50mM glucose; 25mM Tris-HCI, pH 8.0; 10mM EDTA, pH 8.0) and II (0.2M 
NaOH; 1 % SDS) in a ratio of 1 :2 onto the dishes. The lysates were neutralized by 
0.5 vol solution III (a mixture of potassium acetate and acetic acid, 3M with re- 
spect to potassium and 5M with respect to acetate). After centrifugation the su- 
pernatant was precipitated with isopropanol, resuspended and incubated at 55°C 
in 20mM Tris-HCI pH 8.0; 10mM EDTA pH 8.0; tOOmM NaCI; 0.2% SDS; in 
presence of 200 pg/ml of proteinase K (Fermentas, Lithuania). Next, the samples 
were extracted sequentially with phenol and with chloroform followed by precipita- 
tion with ethanol. The nucleic acids were resuspended in 10mM Tris-HCI pH 8.0; 
1mM EDTA pH 8.0; 20 pg/ml RNase A (Fermentas, Lithuania) and incubated for 
' 30 min at 65°C. The samples were digested with linearizing endonuclease (Ndel; 
Fermentas, Lithuania) in case of the vectors or Hindlll (Fermentas, Lithuania) in 
case of the HPV110RI) and Dpnl (New England Biolabs, USA) (breaks non- 
replicated DNA), followed by Southern blotting performed essentially as described 
earlier using a vector specific radiolabeled probe. For positive control of hybridiza- 
tion appropriate markers of the linearized vectors and HPV110RI were used (la- 
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(lanes marked as M on figure 15). As seen from the results set forth in figure 15, 
no replication signal was detected in case of any vector plasmids. 

6.8 EXAMPLE 8 

5 Analysis of the E2 and its binding sites dependent segregation func- 

tion of the vectors in dividing cells 

As has been described previously, bovine papillomavirus type 1 E2 protein 
in trans and its multiple binding sites in cis are both necessary and sufficient for 
the chromatin attachment of the episomal genetic elements. The phenomenon is 
10 suggested to provide a mechanism for partitioning viral genome during viral infec- 
' tion in the dividing cells [lives, I., et al., J Virol. 73 (1999) 4404-4412]. Because 
both functional elements are also included into our vector system, the aim of this 
study was analyze the importance of the E2 protein and oligomerized binding si- 
tes for maintenance of the transcriptionally active vector element in population of 
15 dividing cells. 

For this purpose the Nef coding sequence of the vectors NNV-2wt and su- 
per6wtwas replaced with coding sequence of the destabilized form of green fluo- 
rescent protein (d1 EGFP) derived from vector pd1 EGFP-N1 (Clontech Laborato- 
ries). Because the half-life of this protein is as short as 1 hour, it does not accu- 

20 mulate in the cells and the d1 EGFP expression detected by flow cytometer corre- 
lates with the presence of transcriptionally active vector in these cells. 

From NNV-2wt the nef coding sequence was removed and Smal-Noti 
fragment from the pd1EGFP-N1 was inserted instead of it. New vector was 
named as 2wtd1EGFP (Figure 16). Similar replacement was made in case of su- 

25 per6wt for generation gf10bse2 (Figure 17), respectively. The recognition se- 
quence for restrictional endonuclease Spel was introduced into the EcoRI site in 
the super6wt just upstrem the ten E2BS. The vector gf 10bse2 is derived from this 
plasmid by replacing the Nef coding sequence containing Ndel-Bst1 1071 frag- 
ment with d1 EGFP coding sequence containing fragment from 2wtd1 EGFP, cut 

30 out with same enzymes. 

Negative control plasmids lacking either functional E2 coding sequence or 
its binding.sites were also made: The frameshift was introduced into the E2 cod- 
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ing sequence in context of the 2wtd1 EGFP by replacing E2 coding sequence con- 
taining Bsp120l-Bsp120l with similar fragment from plasmid NNV-2wtFS.The re- 
sulting vector was named as2wtd1EGFPFS (Figure 18). For the construction the 
control plasmid NNVdl EGFP (Figure 19) the whole E2 expression cartridge (as 
5 well bacterial replicon) from the 2wtd1 EGFP was removed by Bst1 1 07 and Nhel 
digestion. The replicon was reconstituted from plasmid productl as Hindlll (filled 
in)-Nhel fragment. 

Jurkat cells were transfected by electroporation with 1ug of the vector 
2wtd1EGFP or with equimolar amounts of the plasmids 2wtd1EGFPFS, 

10 NNVdl EGFP, gf10bse2 or with carrier DNA only as described in Example 1. At 
different time-points post-transfection the equal aliquots of the cell suspension 
were collected for analysis and the samples were diluted thereafter with the fresh 
medium. At every time-point total number of the cells as well the number of the 
dIEGFP expressing cells were counted by flow cytometer (Becton-Dickinson 

15 FACSCalibur System). With these data, the percentages of d1 EGFP expressing 
cells, alterations of total numbers of cells and numbers of dIEGFP expressing 
cells in samples were calculated using the carrier-only transfected cells as a ne- 
gative control for background fluorescence. The calculations of cell numbers were 
done in consideration of the dilutions made. Finally, the error values were calcula- 

20 ted based on technical data of the cytometer about fluctuations of speed of the 
flow. 

Two independent experiments were done. First, the maintenance of 
dIEGFP expressed from the plasmids 2wtd1EGFP, 2wtd1EGFPFS and 
NNVdl EGFP were analyzed during the eight days post-transfection. In the sec- 

25 ond experiment the maintenance of dIEGFP expressed from the plasmids 
2wtd1EGFP, 2wtd1EGFPFS and gf10bse2 were analyzed during the thirteen 
days post-transfection. 

As is obvious from Figures 20 and 21, there was no difference of the 
growth speed of the cells transfected with any vector or carrier only. It means 

30 that differences in the d1 EGFP expression maintenance are not caused by influ- 
ences of transfected vectors themselves on the dividing of the cells. Also, during 
the assay the logarithmic growth of the cells were detected, except the period un- 
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til second time-point in the experiment represented in Figure 20. This lag period 
of the growth is probably caused by the electroporation shock of the cells, be- 
cause the first time-point was taken already 1 9 hours after the transfection. 

As illustrated in Figures 22 and 23, the percentages of green fluorescent 
protein expressing cells decrease in all populations transfected with either plas- 
mid, because the vectors do not replicate in the cells. However, as is seen on the 
charts, the fraction of positive cells declines more rapidly in cases of control vec- 
tors, if compared with the 2wtd1EGFP or gf10bse2. If compared with each other, 
the gf1 0bse2 have clear benefit to 2wtd1 EGFP (Figure 23.). There is also a nota- 
ble difference of maintenance between control plasmids 2wtFSd1EGFP and 
NNVdIEGFP (Figure 22). 

These differences between the vectors become much more obvious, if the 
data are represented as alterations of the numbers of the dIEGFP expressing 
cells in the populations (Figures 24 and 25). The numbers of the positive cells in 
cases of the control plasmids are not notable changed during the assay. In con- 
trast, in case of the 2wtd1EGFP the number of dIEGFP expressing cells in- 
creases during the first week after the transfection becoming approximately five 
to ten times higher than in control samples (Figure 24). After this time-point the 
number start to decrease (Figure 25). The difference of maintenance is strongest 
in the case of the gf10bse2 vector. The number of positive cells increases con- 
tinuously during the analyses period. After two weeks it is 6 times higher than in 
the sample transfected with 2wtd1 EGFP and 45 times higher than in the popula- 
tion transfected with frameshift mutant (Figure 25). 

The data demonstrate clearly that the vector system of the present inven- 
tion has active mechanism of segregation based on a nuclear-anchoring protein, 
i.e. bovine papillomavirus type 1 E2 protein and its binding sites that promotes its 
maintenance in a population of proliferating cells as a transcriptionally active ele- 
ment. 

6.9 EXAMPLE 9: 

CLONING OF THE AIRE GENE INTO SUPER6WT AND EXPRESSION 
IN AN EPITHELIAL CELL LINE 
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The AIRE gene coding for the AIRE protein (AIRE = autoimmune regula- 
tor) is mutated in an autosomally heredited syndrome APECED (Autoimmune po- 
lyendocrinopathy candidiasis ectodermal dystrophy). AIRE is expressed in rare 
epithelial cells in the medulla of thymus and in the dendritic cells in peripheral 
5 blood and in peripheral lymphoid organs. APECED could thus be treated by 
transferring the non-mutated AIRE gene ex vivo to peripheral blood dendritic 
cells, followed by the introduction of the corrected dendritic cells back to the pa- 
tient. To test this possibility human AIRE gene and the homologous murine AIRE 
gene were transferred to COS-1 cells. 
10 ' For cloning of the AIRE gene into Super6wt a maxi-preparation of the vec- 

tor was prepared. First a transfection with Super6wt was done to TOP1 0 -cells 
(chemically competent Escherichia coli by Invitrogen) according to manufacturer's 
protocol. Briefly, the cells were incubated on ice for 30 minutes, after which a 
heat shock was performed in a water bath at +42°C for 30 seconds. The cells 
1 5 were then transferred directly on ice for 2 minutes and grown in 250 pi of SOC- 
medium (2% tryptone, 0.5% yeast extract, 10 mM NaCI, 2.5 mM KCI, 10 mM 
MgCI2, 10 mM MgS04, 20 mM glucose) at +37°C with shaking for 1 hour. 

Plating was done on LB-plates using kanamycin (50 ug/ml) for selection. 
For maxiprep, colonies were transferred to 150 ml of LB-solution containing ka- 
20 namycin (50 ug/ml) and grown overnight at +37°C with shaking. Preparation of 
maxiprep was done using Qiagen's Plasmid Maxi Kit according to manufacturer's 
protocol. 

A digestion with BamHI and Sail restriction enzymes was used to check 
the vector. The reaction mixture contained 500 ng of Super6w, 5 U of BamHI, 5 U 
25 of Sail, 2 pi of 2XTANGO buffer (both the restriction enzymes and buffer from 
Fermentas) and sterile water in total volume of 10 pi. The digestion was carried 
out at +37*C for one hour. 

The digested vector was checked with 1 % agarose gel containing ethidium 

bromide 1 pg/ml in 1X TAE-buffer. 
30 For cloning of the PCR amplified AIRE gene and Aire fragments into the 

Super6wt, 4 pg of Super6wt was digested with 1 0 U Natl restriction enzyme (MBI 
Fermentas, in 2 pi enzyme buffer and sterile water added to a final volume of 20 
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pi. The digestion was carried at +37°C for 1.5 hours, after which 1U of ZIP- 
enzyme (alkaline phosphatase) was added to the reaction mixture and incubated 
further for 30 minutes. The ZIP-enzyme treatment was done to facilitate the inser- 
tion of the AIRE gene into the vector by preventing the self-Iigation of the vector 
5 back to a circular mode. After the digestion the vector was purified using GFXTM 
PCR DNA and Gel Band Purification Kit (Amersham Pharmacia Biotech) and dis- 
solved in to a concentration of 0.2 micrograms/microliter. 

Human and mouse AIRE-gene PCR-products were also digested with Notl 
restriction enzyme. To the digestion, 26 pi of PCR product, 3 pi of an appropriate 
10 enzyme buffer and 1 0U of Notl restriction enzyme (the buffer and enzyme from 
MBI Fermentas) was used. The digestion was carried out at +37°C for 2 hours, 
after which digested PCR-products were purified and dissolved in sterile water to 
a volume of 10 pi. 

The PCR amplified and digested human and mouse AIRE genes were li- 
15 gated to Super6wt by a T4 DNA ligase (MBI Fermentas). The digested insert 
DNA was taken (a total volume of 1 0 pi), 1 .5 pi of ligase buffer (MBI Fermentas), 
5U of T4 DNA ligase and sterile water was added to a final concentration of 15 pi. 
The ligation was carried out at +17*C overnight. 

After the ligation 10 pi of ligation reaction mixture was taken for transfec- 
20 tion into TOP10 cells according to manufacturer's protocol. The cells were incu- 
bated on ice for 30 minutes, after which a heat shock was performed in a water 
bath at +42 6 C for 30 seconds. The cells were then transferred directly on ice for 
2 minutes and grown in 250 pi of SOC-medium (2% tryptone, 0.5% yeast extract, 
10 mM NaCI, 2.5 mM KCI, 10 mM MgCl 2 , 10 mM MgS0 4 , 20 mM glucose) at 
25 +37*C with shaking for 1 hour. 

The transfected bacterial cells were plated onto LB-kanamycin plates and 
colonies were picked on the following day to 2 ml of LB-medium (1% tryptone, 
0.5% yeast extract, 1 70 mM NaCI) with kanamycin and grown overnight at +37 W C. 
Miniprep DNA preparations from selected colonies were purified using 
30 Qiagen's Plasmid Midi Kit and dissolved to a volume of 50 pi of sterile water. The 
presence and size of the insert was checked with Notl and BamHI digestion. 1 0 pi 
of miniprep DNA was taken for digestion, 5U of Notl and 5U of BamHI enzymes, 
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2 ml of R+ enzyme buffer and sterile water was added to a final volume of 20 pi. 
The digestion was carried out at +37*C for 1 hour. 

The orientation of the insert was analysed with BamHI restriction enzyme. 
Ten pi of minprep DNA was taken, 5 U of BamHI, 2 pi of BamHI buffer (MBI Fer- 
5 . mentas) and sterile water was added to a final volume of 20 pi. The digestion was 
carried out for 1 hour at +37°C and the products were checked on a1 % agarose 
gel with EtBr in 1XTAE. 

On the basis on these results, a plasmid containing a mouse AIRE-gene 
and a plasmid containing a human AIRE-gene were picked and maxipreps were 
10 prepared. Briefly, 0.5 ml of E. coli cell suspension containing the plasmid of inter- 
est or a miniprep culture was added to a 150 ml LB-medium containing kanamy- 
cin (50 pg/ml) and grown overnight at +37*C. Maxiprep DNAs were prepared us- 
ing Qiagen's Plasmid Maxi Kit. 

The plasmid containing the mouse AIRE-gene was designated as 
15 pS6wtmAIRE and plasmid containing the human AIRE-gene as pS6wthAIRE. 

The generated vectors were sequenced for approximately 500 bp from 
both ends to verify the orientation and correctedness of the insert. The sequenc- 
ing was performed using the dideoxy method with PE Biosystem's Big Dye Ter- 
minator RR-mix, which contains the four different terminating dideoxynucletide 
20 triphosphates labeled with different fluorescent labels. 

Plasmids containing the AIRE gene and AIRE gene fragments were in- 
serted into selected cell lines to check the expression of the protein with Western 
blot after the transfection. 

Cos-1 cells were harvested with trypsin-EDTA (Bio Whittaker Europe) 
25 solution and suspended 1 0x1 06 cells/ml into Dulbecco's MEM (Life Technologies) 
medium and 250 pi of cell suspension was taken for transfection. The transfec- 
tion of Cos-1 cells was performed using electroporation with 2.5x1 06 cells, 50 pg 
of salmon sperm DNA as a carrier and 5 pg of appropriate vector, the transfec- 
tions were made with pS6wthAIRE, pS6wtmAIRE, Super6wt, pCAIRE, psiAIRE 
30 and pCAIRE S1-4. pCAIRE and psiAIRE are positive human AIRE controls, 
pCAIRE S1-4 is a positive mouse AIRE control and Super6wt is a negative con- 
trol. 
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The electroporation was done using Biorad's Gene Pulser with capaci- 
tance 960 pFd, 240 V and 1 pulse. After the pulse the cells were kept at room 
temperature for 10 minutes and 400 pi of medium was added. The cells were 
transferred to 5 ml of medium and centrifuged for 5 minutes with 1 000 rpm. Cells 
5 were plated and grown for 3 days at +37*C, 5% C02. 

The ceils were harvested with trypsin-EDTA and centrifuged. Then Cells 
were then washed once with 500 ml of 1XPBS (0.14 mM NaCI, 2.7 mM KCI, 7.9 
pM Na2HP04, 1 .5 pM KH2P04). 50 pi of PBS and 1 00 pi of SDS loading buffer 
(5% mercaptoethanol, 16 pM Bromphenolblue, 20 pM Xylene Cyanol, 1.6 mM Fi- 

10 coll 400) was added and cells were heated at +95°C for 10 minutes. 

For the western blot analysis SDS-PAGE was prepared with 10% separa- 
tion and 5% stacking gels in a SDS running buffer (25 mM Tris, 250 mM glysin, 
0.1% SDS). Cell samples and biotinylated molecular weigh marker were loaded 
on the gel and electrophoresis was performed with 150 V for 1h 50 minutes. The 

1 5 transfer of proteins to a nitrocellulose membrane was performed at 1 00 V for 1 .5 
hours at room temperature with a cooler in transfer buffer. 

The membrane was blocked in 5% milk in TBS (0.05 M Tris-CI, 0.1:5 M 
NaCI, pH 7.5) for 30 minutes at room temperature. A primary antibody mixture, 
anti-AIRE6.1 (human) and anti-AIRE8.1 (mouse) antibodies at a dilution of 1 :100 

20 in 5% milk in TBS, was added onto membrane and incubated overnight at +4°C. 
The membrane was washed two times with 0.1% Tween in TBS for 5 minutes 
and once with TBS for 5 minutes. The secondary antibody, biotinylated anti- 
mouse IgG at a dilution of 1:500 in 5% milk in TBS was incubated for 1 hour at 
room temperature. The membrane was washed and horseradish peroxidase avi- 

25 din D at a dilution of 1:1000 in 5% milk in TBS was added. The membrane was 
incubated at room temperature for 1 hour and washed. A substrate for the per- 
oxidase was prepared of 5 ml chloronaphtol, 20 ml TBS and 10 pi hydrogen per- 
oxide and added onto membrane. After the development of the color the mem- 
brane was washed with TBS and dried. 

30 The antibody detecting with human AIRE (anti-AIRE6. 1 ) detected the AI- 

RE protein expression in the preparates transfected with pS6wthAIRE, pCAIRE 
and psiAIRE. The antibody detecting murine AIRE detected likewise the murine 
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AIRE in cells transfected with pS6wtmAIRE and pCAIRE S1-4. The negative con- 
trol (Super6wt) showed no AIRE/aire proteins. 

6.10 EXAMPLE 10: DETECTION OF CELLULAR AND HUMORAL 
5 IMMUNE RESPONSE TOWARD HIV.1 NEF IN MICE IMMUNIZED 

WITH THE NNV-NEF CONSTRUCT 

DNA IMMUNIZATIONS 

To further study the induction of humoral immunity by the vectors of the in- 
ventions, 5-8 weeks old both male and female BALB/c (H-2d) mice were used. 

10 For the DNA immunizations, the mice were anaesthetized with 1 ,2 mg of pento- 
barbital (i.p) and DNA was inoculated on shaved abdominal skin using plasmid 
DNA coated gold particles. The inoculation was made with Helios Gene Gun (Bio- 
Rad) using the pressure of 300 psi. The gold particles were 1 pm in diameter, ~1 
pg of DNA/cartridge.The mice were immunized twice (on day 0 and day 7) with a 

1 5 total amount of DNA of 0,4 or 8 pg/mouse. The control mice were immunized with 
8 pg of the plain vector without the nef^gene, i.e.NNV-deltanef. 

A blood sample was taken from the tail of the mice two weeks after the last 
immunization. The mice were sacrificed four weeks after the last immunization 
and blood samples (100 pi) were collected to Eppendorf tubes containing 10 pi of 

20 0,5 M blotting (++ vs. +) and in ELISA (higher OD t more mice in higher-dose 
above cut-off EDTA. The absolute number of leukocytes/ml of blood was calcu- 
lated from these samples for each mouse. The sera were collected for antibody 
assays and stored at -20°C. The spleens were removed aseptically, weighted 
and then homogenized to single cell suspensions for use in T, B and NK cell as- 

25 says and staining. 

Detection of the humoral immunogenicity of the vectors of the inven- 
tion 

For the detection of Nef-specific antibodies by Western blotting, serum 
samples from mice immunized with the vector constructs of the invention were di- 
30 luted 1 :1 00 to 5% milk in TBS and applied on nitrocellulose strips made with re- 
combinant HIV-1 Nef protein. For the preparation of the nitrocellulose strips, the 
purified recombinant protein was boiled in a sample buffer containing 1% SDS 
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and 1% 2-mercaptoethanol, then run on a 10 or 12.5 % polyacrylamide gel and 
subsequently transferred onto a 0.45 pm nitrocellulose paper. The strips were 
first blocked with 2% BSA in 5% defatted milk-TBS and thereafter incubated with 
diluted sera (1 :1 00) overnight. After incubation, unbound proteins were removed 
5 by washing the strips three times with TBS - 0.05% Tween-20 and twice with wa- 
ter. After washings, the strips were probed with a 1:500 dilution of biotinylated 
anti-mouse IgG (Vector Laboratories, USA) for 2 hour. After further washings, 
horseradish peroxidase-avidin in a dilution of 1 :1000 (Vector Laboratories, USA) 
was added for 1 h, the strips were washed again and the bound antibodies were 

1 o detected with a hydrogen peroxidase substrate, 4-chIoro-1 -naphtol (Sigma, USA). 

The sera were also tested in ELISA to determine the exact antibody titers 
induced by each construct. Nef antibody ELISA was performed as previously de- 
scribed (Tahtinen et al., 2001 ). Briefly, Nunc Maxi Sorp plates were coated with 
50 ng of Nef (isolate HAN), blocked with 2% BSA in phosphate buffered saline 

15 (PBS), and the sera in a dilution of 1:100 to 1:25000 were added in duplicate 
wells for an overnight incubation. After extensive washings, the secondary anti- 
body, peroxidase conjugated anti-mouse IgG or IgM (DAKO), was added, and the 
plates were incubated for two hours and then washed. Color intensity produced 
from the substrate (2,2'-azino-bis(3-ethylbenzthiazoline-6-sulfonic acid, ABTS, 

20 Sigma) in a phosphate-citrate buffer was measured at 405 nm using a Labsys- 
tems Multiscan Plus ELISA-plate reader. The optical density cut-off value for 
positive antibody reactions was determined as follows: 
cut-off = OD (xl control mice sera) + 3 SD. 

Detection of the cellular immunogenjcity of the vectors of the invention 
25 To analyze the capacity of the vectors of the invention to induce cellular 

immunity, T-cell and B-cell assays as well as cell surface staining were per- 
formed. 

T cell proliferation assay. The spleen cells were suspended to a final 
concentration of 1 x 106/ml RPMI-1640 (GibcoBRL) supplemented with 10% FCS 
30 (GibcoBRL), 1% penicillin-streptomycin (GibcoBRL) and 50 pM beta- 
mercaptoethanol (Sigma). Cells were incubated in microtitre plates at 200 pl/well 
with media only or with different stimuli. The final concentrations of stimuli were: 
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Con A 5 pg/ml, HIV-Nef-protein at a concentration of 1 and 10 pg/ml, and a nega- 
tive control antigen HIV-gag at a concentration of 1 and 10 |jg/ml. All reactions 
were made in quadruplicates. On the sixth day of the incubation 100 pi of 
supernatant from each well was collected and stored at -80°C for cytokine 

5 assays. Six hours before harvesting 1 pCi of 3H-thymidine (Amersham 
Pharmacia Biotech) was added to each well. The cells were harvested and 
radioactivity incorporated (cpm) was measured in a scintillation counter. The 
stimulation indexes (SI) were calculated as follows 

SI = mean experimental cpm/ mean media cpm. 

io Lymphocyte activation. T and B cell activation was detected by double sur- 

face staining of fresh splenocytes with anti-CD3-FITC plus anti-CD69-PE (early 
activation marker) and anti-CD19-FITC plus anti-CD69-PE antibodies (all from 
Pharmingen). Stainings were analyzed with flow cytometer (FACScan, Becton 
Dickinson). 

15 CTL assays. Mouse splenocytes were co-cultured with fixed antigen pre- 

senting cells (P-81 5 cells infected with MVA-HIV-nef or control MVA-F6) for five 
days after which they were tested in a standard 4 hour 51 chromium release as- 
say [Hiserodt, J., etal.,.J Immunol 135(1995)53-59; Lagranderie, M.,etal., J Vi- 
rol 71 (1 997) 2303-2309) against MVA-HIV-nef infected or control target cells. In 

20 CTL assays the specific lysis of 10% or more was considered positive. 

Cytokine assay. IFN-gamma and IL-10 were measured from antigen- 
stimulated cell culture supematants in order to analyze, whether immunized mice 
develop a Th1 type orTh2 response. The supematants were collected from anti- 
gen-stimulated cells as described above. Pro-inflammatory cytokines TNF-alfa 

25 and IL-10 were measured in the sera of the immunized mice. All cytokines were 
measured with commercial ELISA kits (Quantikine, R&D Systems). 

Spontaneous proliferation. Spontaneous splenocyte proliferation was 
detected by 3H-thymidine uptake of the cells cultured in the medium only for 6 
days.- 

30 Anti-double strand (ds) DNA antibodies. dsDNA antibodies were meas- 

ured in the sera of immunized mice, positive control mice (mrl/lpr, a generous gift 
from Dr. Gene Shearer, NIH, USA) and normal mice. The antibodies were as- 



WO 02/0905581 



PCT/FI02/00379 



67 

sayed with ELISA on poly-L-Lysine bounded lambda phage dsDNA. The results 
are shown in Tables 2 and 3. 

Table 2 shows complete immunological results of the mice immunized with 
HIV-Nef plasmid DNA. Although HIV-1 Nef recombinant protein, which was used 
5 for in vitro T cell stimulation, induced some non-HIV-specific proliferation of the 
cells in each immunized group, there was a significant increase in the mean SI of 
mice immunized with 0,4 (jg of the plasmid (mean Sl=72,2) compared to others. 
Furthermore, negative control protein HIV-gag did not induce any T cell response. 
Only the T cells of the mice in the group that had nef-specific proliferation also 

10 produced nef-specific IFN-gamma. None of the immunized mice had cells 
producing IL-10, which shows that the T cell response in the immunized mice was 
of Th1 type and not of Th2 type. In contrast to the T cell response, mice 
immunized with the higher concentration of nef plasmid DNA (8 pg) had a 
stronger B cell response compared to mice immunized with 0,4 pg: the humoral 

15 • response in mice immunized with the higher dose was detectable already three 
weeks after the last immunization and the response detected was stronger both 
in Western-). The antibodies detected belonged to IgG-class, no IgM response 
was detected. None of the mice developed E2 specific antibody. 

The mice immunized with 0.4 pg of HIV-nef plasmid DNA had an in- 

20 creased number of leukocytes (6.38x1 06/ml) in the peripheral blood compared to 
other groups of immunized mice and normal mice (3.8x106/ml) (Table 3). The 
same mice had twice as much activated T cells (21 %, CD3+CD69+) compared to 
other mice (9% and 1 0%). This finding is in correlation with the positive T cell re- 
sponse to HIV-Nef (Table 2), since the mice with a positive T cell response to Nef 

25 also had an increased number of activated T cells in their spleens. The results of 
Table 3 also show that none of the immunized mice developed anti-dsDNA anti- 
bodies as compared to positive control sera (OD=1,208) indicating that there is 
no adverse effect of the immunization. 
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Mic e WBC CD3 CD3+CD69+ CD19 CD19+CD69+ anti-dsDNA 
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6.11. EXAMPLE 11: SAFETY AND IMMUNOGENICS OF 
A PROTOTYPE HIV VACCINE GTU-NEF IN HIV INFECTED PA- 
TIENTS 

5 Production of the NNV-2-Nef vaccine (CHECK whether NNV-2 

or NNVwt-2 was used) 

The investigational vaccine NNV-2-Nef was prepared according to 
Example 2 with the Manufacturing License No. LLDnro 756/30/2000 (issued by 
the Finnish National Agency for Medicines on 21 .1 2.2000). 
10 The manufacturing processes performed fulfilled the current Good 

' Manufacturing Practices (cGMP) requirements and provided plasmid DNA 
preparations suitable for use in clinical phase I and II studies. The manufactur- 
ing process consisted of four steps: 

a) Establishment of Master Cell Banks and Working Cell Banks , 
15 b) Fermentation 

c) Purification 

d) Aseptic filling of the vaccine 

In detail, NNV-2-Nef was produced in £ coli bacteria. The Master 
Cell Banks (MCBs) and Working Cell Banks (WCBs) containing E. coli DH5 al- 
20 pha T1 phage resistant cell strain were established in accordance with the 
specific Standard Operating Procedure from pure cultured and released Re- 
search Cell Banks. 

a) Establishment of Master Cell Banks and Working Cell 

Banks 

2 5 The schematic procedure for establishing the cell bank system is 

illustrated below: 

Thaw of one vial of Research Cell Bank [E. coli DH5 alpha T1 
phage resistant cell strain (Gibco RBL) transformed with the NNV-2-Nef plas- 
mid. 

30 Inoculate of the culture on modified Luria Bertani medium plate 

(containing 25 ng/ml of kanamycin) 

Incubate overnight (14-1 6h) at 37°C 

Select of a single colony from the plate and inoculation into 50 ml 
of modified Luria Bertani medium (containing 25 ng/ml of kanamycin) 
35 Incubate overnight (14-1 6h) at 37°C 

Measure optical density of the bacterial culture (OD 6 oo= 2.0 - 6.0) 
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Add glycerol to bacterial culture 

Divide the culture-glycerol mix to aliquots 

Label and store the Master Cell Banks 

Following the same diagram, the Working Cell Bank was estab- 
5 lished using one vial of the Master Cell Bank as the starting material. The rou- 
tine tests performed on the MCB and WCB were: microbiological characteriza- 
tion, absence of contamination, assessment of the plasmid stability by replica 
plating and the plasmid identity (restriction enzyme digestion and sequencing). 

b) Fermentation. In the fermentation the DH5 alpha T1 phage 
10 resistant E. coli strain (Gibco RBL, UK) transformed with NNV-2-Nef (WCB) 

was first cultured on plate. From the plate a single colony was inoculated to a 
100 ml liquid pre-culture before the actual fermentation in the fermentation 
reactor. The fermentation was carried out in a 5 I fermentor (B. Braun Medical) 
on a fed-batch system basis, after which cells were harvested. The culture 

15 medium composition for one litre contained 7g of yeast extract, 8g of peptone 
from soy meal, 10g of NaCl, 800ml of water for injection (WFI), 1N NaOH, pH 
7.0, kanamycin 50mg/ml (Sigma), silicon anti-foaming agent (Merck), 1M 
K 2 P0 4 (BioWhittaker). 

in the beginning of the fermentation run, a 1 ml sample was taken 

20 through the harvesting tube to determine the initial cell density (OD 6 oo). The 
pre-culture was used to inoculate the fermentation medium. During the fermen- 
tation, fresh culture medium and 1M potassium phosphate buffer, pH 6.5 - 7.3, 
were fed to the reactor with the pumps. Addition of the medium allows replen- 
ishment of essential nutrients before they run out and phosphate buffer main- 

25 tains the pH constant. When the fermentation process had continued for ap- 
proximately 5 hours and at the end of the fermentation run (after approximately 
10 hours of fermentation), samples of 1 ml were taken as above and the cell 
density was measured. After the fermentation, the culture medium was centri- . 
fuged (10,000 rpm, 30 minutes, +4°C) and the bacterial pellet (50-60g) was re- 

30 covered. 

c) Purification. The methodology used for the purification of DNA 
was based on the QIAGEN process scale technology (Qiagen Plasmid Purifi- 
cation Handbook 11/98). The NN2-Nef was purified using the following steps: 

Resuspend the bacterial pellet in the resuspension buffer (100- 

35 150ml, RT) 

Lyse with the lysis buffer (100-1 50ml, 5 minutes, RT) 



\ 
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Neutralize with the neutralization buffers (100-1 50ml, +4 C) 
Incubate (30minutes, +4 C) 
Centrifugate (10,000 rpm, 30 minutes, +4 C) 
Filtrate supernatant (0.22 micrometers) 
Remove endotoxins with Endotoxin removal buffer (60-90ml) 
Equilibrate Ultrapure column with Equilibration buffer (350ml, flow 
rate 10ml/min) 

Load lysate to the column (flow rate 4-6ml/min) 
Wash the column with Wash buffer (31, overnight, flow rate 4- 
10 6ml/min) 

Elute the plasmid DNA with Elution buffer (400ml, flow rate 

3.1ml/min) 

Filtrate the eluate (0.22 micrometer) 
Precipitate DNA with isopropanol 
1 5 Centrifuge (20000g, 30 minutes, 4 C) 

Purified plasmid DNA 



Buffers used within the purification were as follows. The resuspen- 
sion buffer contained 50mM Tris-CI, pH 8.0, plus RNase A (50mg); the lysis 
20 buffer was 200mM NaOH; the neutralization buffer was 3M potassium acetate, 
pH 5.5; the endotoxin removal buffer contained 750mM NaCI, 10% Triton X- 
100; 50mM MOPS, pH7.0; the equilibration buffer contained 750mM NaCI, 
50mM MOPS, pH 7.0; the wash buffer contained 1 M NaCI, 50mM MOPS, pH 
7.0, 15% isopropanol; and elution buffer contained 1.6 M NaCI, 50mM MOPS, 
25 pH 7.0, 15% isopropanol. 

d) Aseptic filling 

The purified DNA representing the final bulk was dissolved in 0,9% 
sterile physiological saline to a final concentration of 1 mg/ml and sterile fil- 
tered (0.22 micrometer) during the same day. The purified bulk was filled 

30 manually (filling volume 0.5 ml) in Schott Type 1 plus glass vials using a steam 
sterilized Finnpipette® and sterile endotoxin-free tips. The vials filled with the 
NN2-Nef vaccine were closed immediately, labelled and packed in accordance 
to the specific Standard Operating Procedure (SOP). 

2. Administration of the test vaccine to the patients 

35 Ten HlV-1 infected patients undergoing Highly Active Anti- 

Retroviral therapy (HAART) were immunized with the experimental DNA vac- 
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cine NN2-Nef, expressing the HIV-1 Nef gene (Clade B). For immunizations, 
two intramuscular injections in the gluteal muscle were given two weeks apart. 
The doses were 1 and 20 micrograms/injection. Blood samples were drawn at 
-4, 0, 1, 2, 4, 8 and 12 weeks. The samples were analyzed for humoral 
(ELISA, Western blot) and cell mediated immune response (T-cell subsets, T- 
cell proliferation, ELISPOT, cytokine expression, intracellular cytokines). 

A clinical examination was performed to each patient participating 
the study. The clinical examination included a patient interview (anamnesis) 
and weight determination. Cardiac and pulmonar functions were checked by 
auscultation and percussion, the blood pressure and heart rate were recorded. 
Enlargement of lymph nodes, liver and thyroid gland were determined by pal- 
pation. 

Laboratory tests to evaluate the safety of the vaccine were per- 
formed at each visit. These tests included: 

Hematology: red blood cell count, haemoglobin, total and differen- 
tial WBC, platelet count, prothrombin time and activated partial thromboplastin 
time at baseline; mean erythrocyte corpuscular volume and hemoglobin con- 
tent has been calculated. 

Immunology : nuclear and ds-DNA antibodies. 

Serum chemistries: total bilirubin, alkaline phosphatase, 
SGOT/SLT or SGPT/ALT, serum creatinine, protein electrophoresis, total se- 
rum cholesterol, triglycerides, glucose (at baseline), sodium, potassium, and 
calcium. 

Urine analysis: dipstick protein, glucose, ketones, occult blood, 
bile pigments, pH, specific gravity and microscopic examination of urinary 
sediment (RBC, WBC, epithelial cells, bacteria, casts), when dipstick determi- 
nation showed one or more abnormal values. 

Viral load: Increases of more than one log 10 should be followed 
by a confirmatory viral load estimate after two weeks. 

None of the patients experienced subjective or objective adverse 
reactions to the vaccination. No adverse laboratory abnormalities were ob- 
served in the panel of clinical chemistry tests (see material and methods for 
details) performed repeatedly during the vaccination period. 

The following immunological studies were performed: 
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Lymphocyte proliferation assay (LP A) 

Peripheral blood mononuclear cells (PBMC) were isolated from 
heparinized venous blood by Ficoll-Hypaque density-gradient (Pharmacia) 
centrifugation and resuspended at 1x10 6 cells/ml in RPMI 1640 medium 

5 (Gibco) supplemented with 5% pooled, heat-inactivated AB + serum (Sigma), 
antibiotics (100 U/ml penicillin and 100 pg/ml streptomycin; Gibco) and L- 
glutamine (complete medium, CM). Quadruplicate cultures were then set up in 
flat-bottomed micro titer plates (1x1 0 s PBMC/well) and the cells were incu- 
bated for 6 days in the presence or absence of the following stimuli: rNef (0.2, 

10 1 and 5 pg/ml), GST (0.2, 1 and 5 pg/ml), purified protein derivative of tubercu- 
lin (PPD, 12.5 pg/ml; Statens Seruminstitut), Candida albicans antigen (20 
pg/ml;- Greer Laboratories) and Phytohaemagglutinin (PHA; 5 pg/ml; Life 
Technologies). For the last 6 h of the incubation period 3 H-thymidine (1 
pCi/well; Amersham) was added to the cultures and the cells were harvested 

15 onto glass fiber filters and incorporated radioactivity was measured in a y- 
counter. Results are expresses as delta cpm (cpm in the presence of antigen- 
cpm without antigen) or as stimulation index (cpm in the presence of antigen/ 

cpm without antigen). 

The results are shown in Figures 26 and 27. None of the vac- 

20 cinees showed significant T-cell proliferative response to the test antigen, HIV- 
1 Nef before the vaccination. In contrast, 2 out of 5 vaccinees in the group that 
had received 1 microgram dose of the test vaccine (patients 1 and 3) (Figure 
26) and 2 out of 5 in the group receiving 20 micrograms of the test vaccine 
(patients 9 and 10) (Figure 27) showed a strong T-cell proliferative response 

25 after the first vaccination. After the second vaccination, one (patient 2) vac- 
cinee responded in the 1-microgram group. 
IFN- y assays 

The type of immune response (Th1/Th2) induced by the vaccine 
was evaluated by measuring interferon-gamma (IFN- y) released in 6 days old 

30 culture supernatant after antigen (rNef, rGST, PPD) or mitogen (PHA) stimula- 
tion of PBMC. For determinations, commercial ELISA kits (R&D Quantikine) 
were used. The assay employ the quantitative sandwich enzyme immunoas- 
say technique where a monoclonal antibody specific for IFN- y has been 
coated onto a microplate. Standards and samples are pipetted into the wells 

35 and any IFN- y present is bound by the immobilized antibody. After washing 
away any unbound substances, an enzyme-linked polyclonal antibody specific 
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for IFN- y is added to the wells. Following a wash to remove any unbound an- 
tibody-enzyme reagent, a substrate solution is added to the wells and color 
develops in proportion to the amount of the cytokine bound in the initial step. 
The color development is stopped and the intensity of the color is measured. 
5 IFN-Y response data from patient # 1 is shown in Figure 28. As can 

be seen, the vaccinee responded to the rNef antigen by marked IFN-y re- 
sponse correlated with the T-cell proliferation, indicating that the response 
seen in the vaccinee is in fact of the Th1 type. 

HIV-1 infection is characterized by low or totally lacking cell-mediated 

10 immune response towards all HIV proteins. The results show that it is possible 
to induce a robust CM I in such patients with exceptionally low doses of the 
DNA vaccine NN2-Nef. The doses used were minimal to what has generally 
been required with DNA vaccines. Thus, for instance, Merch announced re- 
cently good results with their experimental HIV vaccine but the doses required 

1 5 were from 1 000 to 5000 micrograms (I AVI report, 2002). 

6.12 EXAMPLE 12: CONSTRUCTION OF THE PLASMID EXPRESS- 
ING EPSTEIN-BARR VIRUS (EBV) EBNA-1 PROTEIN AND CONTAINING 20 
BINDING SITES FOR EBNA-1 (FR ELEMENT) 

20 To construct a plasmid expressing Epstein-Barr virus (EBV) EBNA-1 

protein and containing 20 binding sites for EBNA-1 (FR element), BPV-1 E2 
binding sites were first replaced by EBV EBNA-1 binding sites (oriP without DS 
element). Plasmid FRE2d1EGFP (Figure 29) was constructed by isolating the 
Xr77/l(Accl)/Eco32l(EcoRV) DNA fragment (blunt-ended with Klenow enzyme) 

25 of pEBO LPP plasmid (Figure 29A) (the fragment contains 20 binding sites for 
EBNA-1) and inserting it by blunt end ligation into the Spel/Nhel site of 
s6E2d1EGFP (Figure 29B) (blunt-ended with Klenow enzyme). The con- 
structed plasmid FRE2d1EGFP (Figure 29) was used as a negative control in 
further experiments. It contains binding sites for EBNA-1 protein instead of the 

30 BPV1 E2 10 binding sites, expressing E2, but not EBNA-1 . 

Next, the sequence encoding BPV-1 E2 protein in FRE2d1EGFP plas- 
mid was replaced by a sequence encoding EBV EBNA-1 protein as follows. 
The Xmil(Accl)/EcoRI fragment of pEBO LPP plasmid was isolated and blunt- 
ended with Klenow enzyme and inserted into the Xbal/Xbal site of 
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FRE2d1EGFP plasmid (blunted* with Klenow enzyme). The vector 
FRE2d1EGFP was previously grown in Escherichia coli strain DH5a lacking . 
Dam - methylation, because one Xbal site is sensitive for methylation. The 
constructed plasmid FREBNAdlEGFP (Figure 30) expresses EBNA-1 protein 
5 , and contains 20 binding sites for EBNA-1 . 

For expression, Jurkat, human embryonic kidney cell line 293 (ATCC 
CRL 1573) and mouse fibroblast cell line 3T6 cells (ATCC CCL 96) were 
maintained in Iscove's modified Dulbecco's medium (IMDM) supplemented 
with 10% fetal calf serum (FCS). Four million cells (Jurkat), 75% confluent 
10 dishes (293) or 1 / 4 of 75% confluent dishes (3T6) were used for each transfec- 
tion, which were earned out by electroporation as follows. Cells were har- 
vested by centrifugation (1000 rpm, 5 min, at 20 °C, Jouan CR 422), and re- 
suspended in a complete medium containing 5mM Na-BES buffer (pH 7.5). 
250 ul cell of the cell suspension was mixed with 50 ug of carrier DNA (salmon 
15 sperm DNA) and 1 ug (in the case of Jurkat and 3T6) or 5 ug (in the case of 
293) of plasmid DNA and electroporated at 200 V and 1000 uF for Jurkat cells, 
170 V and 950 uF for 293 cells and 230 V and 975 uF for 3T6 cells. The trans- 
fected Jurkat cells were plated on 6-cm dishes with 5 ml of medium; 1/3 of 
transfected 293 and 3T6 cells were plated on a 6-cm dishes with 5 ml of me- 
20 dium and 2/3 of the cells were plated on a 1 0-cm dishes with 1 0 ml of medium. 

The transfected cells were analysed for the expression of dIEGFP pro- 
tein (modified enhanced green fluorescent protein). All of the constructed 
plasmids expressed dIEGFP protein, which was detected by measuring the 
fluorescence using a flow cytometer. Because of the short half-life of the 
25 d1 EGFP protein, it does not accumulate, and the expression of this protein re- 
flects the presence of transcriptionally active plasmids in the cells. Becton- 
Dickinson FACSCalibur system was used. The volume of the Jurkat cell sus- 
pension was measured before each time-point (approximately after every 24 
hour) and if the volume was less than 5 ml, the missing volume of medium was 
30 added. Depending on the cell suspension density the appropriate volume was. 
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taken for measuring (1 or 2 ml) and replaced with the same amount of me- 
dium. This was later taken into consideration when the dilution was calculated. 

For the first time-point, 293 cells from the 6-cm dish were suspended in 
5 ml of medium for measuring. In every following time-point half of the cells 
5 were taken from the 10-cm dish, suspended in 5 ml of medium and then 
measured. An appropriate volume was added to the rest of the cell suspen- 
sion. For the first time-point, 3T6 cells from the 6-cm dish were suspended in 1 
ml of trypsine, which was then inactivated with 100 \x\ of FCS. For every follow- 
ing time-point, cells from the 10-cm dish were suspended in 2 ml of trypsin. 1 

10 ml of this suspension was treated as described previously. 9 ml of medium 
was added to the rest of the suspension. The analyzed cells were taken out of 
the incubator immediately before the measurement. The appropriate flow 
speed (500-1000 cells/sec) was determined before each time-point using cells 
transfected only with carrier DNA as a control. Three different parameters were 

15 used to measure size, surface structure and fluorescence of the cells. 

The results are presented as graphs in Figure 31. Cells transfected only 
with carrier DNA were used to measure the auto-fluorescence of the cell-line. 
1% of this auto-fluorescence was considered as background fluorescence and 
was subtracted later from the dIEGFP fluorescence. The received data was 

20 analyzed using Microsoft Excel program. 

Percentages of the dIEGFP expressing cells were calculated using 
cells transfected with the carrier only as a negative control for background fluo- 
rescence. As shown in Figure 33, the two vectors were maintained in the cells 
with different kinetics. 

25 The number of the d1 EGFP expressing cells was calculated taking the 

dilutions into consideration using cells transfected with the carrier only as a 
negative control for background fluorescence. As seen from figure 53, the 
plasmids expressing EBNA-1 and carrying EBNA-1 specific multimeric binding 
sites are maintained very efficiently in the transfected cells. At day 1 after 

30 transfection approximately 8 x 10 4 cells expressed EGFP. At day 8, in the case 
of maintenance vector (FREBNAdl EGFP), the number of the plasmid positive 
dIEGFP expressing cells had increased ten times to 8 x 10 5 . With the plasmid 
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lacking EBNA-1 expression (FRE2d1EGFP) or having no EBNA-1 binding 
sites, the number of plasmid positive cells was retained or in many cases de- 
creased. This fact reflects the mechanism for segregation/partitioning Epstein- 
Barr virus. Maintenance and segregation function by EBNA1 and EBNA-1 

5 binding sites provides maintenance function to the plasmid if EBNA-1 is ex- 
pressed and plasmid carries EBNA-1 binding sites. The same mechanism and 
the same components actually provide the segregation function to Epstein- 
Barr Vims in the latent phase of life-cycle. 

Similar results were obtained also in human embryonic cell line 293 and 

10 mouse cell line 3T6 (Figure 34). As a control for the maintenance for 293 and 
3T6 cells, S6HPV11 and 2wtFS, respectively, were used. 

6.13 EXAMPLE 13: THE IMMUNOGENICITY OF GTU-MULTIGENE 
VECTORS 

THE IMMUNOGENICITY OF GTU-1- MULTIGENE VECTORS 
15 The immunogenic^ of six different multi-gene vaccine constructs 

prepared in Example 12, i.e. GTU-1 -RNT, GTU-1-TRN, GTU-1 -RNT-CTL, 
GTU-1 -TRN-CTL, GTU-1 -TRN-optgag-CTL, and GTU-1 -TRN-CTL-optgag vec- 
tors were tested in mice. The vectors were transformed into TOP10 or 
DH5alpha cells, and the MegaPreps were prepared using commercial Qiagen 
20 columns. Endotoxins were removed with Pierce Endotoxin Removal Gel. 

The test articles were coated on 1 pm gold particles according to 
the instructions given by the manufacturer (Bio-Rad) with slight modifications. 
Balb/c mice were immunized with a Helios Gene Gun using a pressure of 400 
psi and 0.5 mg gold/cartridge. Mice were immunized three times at weeks 0, 1, 
25 and 3. Mice were sacrificed two weeks after the last immunization. 

Mice were divided into six test groups (5 mice/group), which re- 
ceived 3 x 1 pg DNA as follows: 
Group 1. GTU-1 -RNT 
Group 2. GTU-1 -TRN 
30 Group 3. GTU-1 -RNT-CTL 

Group 4. GTU-1 -TRN-CTL 
Group 5. GTU-1 -TRN-optgag-CTL 
Group 6. GTU-1 -TRN-CTL-optgag 



WO 02/090558 ^ 



PCT/FI02/00379 



79 

Group 7. Control mice immunized with empty gold particles not 
loaded with DNA. 

The humoral response was followed from tail-blood samples from 
each mouse. First pre-immunization sample was taken from anesthetized, mice 
5 before the first immunization was given. Second sample was taken from anes- 
thetized mice before the third immunization. At sacrifice, whole blood sample 
was used for white blood cell counting, and serum was collected for humoral 
immunity tests. 

The blood samples were tested for antibodies with ELISA using a 
10 standard procedure. Nunc Maxi Sorp plates were coated with 100 ng of Nef, 
Rev, Tat, Gag, CTL or E2 proteins, blocked and sera at a dilution of 1:100 
were added in duplicate wells for an overnight incubation. After washing, the 
plates were incubated for 2 hours with a diluted (1 :500) secondary antibody, 
peroxidase conjugated anti-mouse IgG (DAKO). Color intensity produced from 
15 the substrate (2,2'-azino-bis(3-ethylbenzthiazoline-6-suifonic acid) (ABTS) in 
phosphate-citrate buffer was measured at 405 nm using Labsystems ELISA- 
plate reader. 

All vectors induced Nef antibodies in all mice, whereas none of the 
mice showed E2, CTL or Rev antibodies (Fig. 35, 36, and Table 4). Some of 

20 the mice immunized with GTU-1 -RNT or GTLM-RNT-CTL also developed Tat 
antibodies (Fig. 36 and Table 4). Furthermore, mice immunized with vectors 
containing the optgag sequence developed also Gag antibodies, but the con- 
struct GTU-1 -TRN-optgag-CTL was a better antibody inducer that the con- 
struct GTU-1 -TRN-CTL-optgag (Figure 37 and Table 4). The antibodies in- 

25 duced were mainly of the lgG1 class indicating a Th2 type of response usually 
seen with gene gun immunization. The antibody assays shown below were 
done from the sera collected when mice were sacrificed. 

The results show that a multigene construct, expressing several HIV 
genes as a fusion protein, can induce an immune response to most of the 

30 gene products. The orientation and order of the genes in the multigene and 
corresponding proteins in the fusion proteins affects the results, however, 
dramatically. Thus, a response against Tat was seen only when the Tat gene 
was placed inside the fusion protein (vectors with RNT motif) and not when Tat 
was the amino terminal protein (vectors with the TRN motif). Response to the 

35 Gag proteins was seen only with the vector, where Gag was placed before the 
CTL containing a stretch of Th and CTL epitopes. 
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Table 4. 



Immuno III A mice ELISA results (OD mean of five mice) 





Immunogen 


Group 
numbe 

r 


Nef (own 
prot) 


Tat 


Rev 


Gag 


CTL 


GTU-1-RNT . 


1 


2.194 


1.391 


0.31 


0.155 


0.36 


GTU-1-TRN 


2 


1.849 


0.197 


0.252 


0.302 


0.38 


GTU-1-RNT- 
CTL 


3 


1.922 


0.555 


0.295 


0.154 


0.439 


GTU-1-TRN- 
CTL 


4 


1.677 


0.211 


0.298 


0.14 


0.425 


GTU-1-TRN- 
optgag-CTL 


5 


1.722 


0.182 


0.24 


0.667. 


0.381 


GTU-1-TRN- 
CTL-optgag 


6 


0.547 


0.225 


0.322 


0.228 


0,43 


Controls 


7 


0.316 


0.226 


0.282 


0.16 


0.405 




Immunogen 


Group 


Percent of 
Nef 

response 


Tat 

respon 
se 


Rev 

response 


Gag 

response 


CTL 

response 


GTU-1-RNT 


1 


100 


80 


0 


0 


0 


GTU-1-TRN 


2 


100 


0 


0 


20 


0 


GTU-1-RNT- 
CTL 


3 


100 


40 


0 


0 


0 


GTU-1-TRN- 
CTL 


4 


100 


0 


0 


0 


0 


GTU-1-TRN- 
optgag-CTL 


5 


80 


• 0 


0 


60 


0 


GTU-1-TRN- 
CTL-optgag 


e 


» 100 


C 


0 


20 


0 
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6.14. EXAMPLE 14: EXPRESSION OF HYBRID PROTEIN EX- 
PRESSING NEF, REV AND TAT IN DIFFERENT COMBINATIONS (MUL- 
TIREG) 

For the production of HIV multi-gene vectors, GTU-1 vector with a multi- 
5 cloning site (Figure 38A) was used as a backbone. Intact Nef, Rev and Tat 
coding sequences were amplified by the polymerase chain reaction (PCR) and 
attached to each other in various orders to multi-regulatory (multireg) antigen 
coding reading frames (Nef-Tat-Rev, Tat-Rev-Nef, Rev-Tat-Nef, Tat-Nef-Rev 
and Rev-Nef-Tat; Sequences Id. No. 1 to 5, respectively). These sequences 

10 were cloned to the Bsp1 1 91 and Notl sites of the GTU-1 vector. 

Similarly, Nef protein expressing GTU-2 and GTU-3 vectors (Figure 38B 
and 38C; see also Figure 6B for NNV-2wt)) were also used as backbones for 
the production of HIV multigene vectors. Additionally, the vector super6wt ex- 
pressing destabilized enhanced green fluorescent protein or dIEGFP .(su- 

15 per6wtd1EGFP; Figure 17 and Figure 38D) and plasmid utilizing the EBNA-1 
protein and its binding sites (FREBNAdlEGFP; Figure 38E) were used as a 
Gene Transfer Unit (GTU) platform. For control "non-GTU"vectors, a regular 
cytomegalovirus (CMV) vector NNV-Rev expressing Rev and a plasmid EBNA- 
1 and E2BS containing dIEGFP plasmid (NNV-Rev and E2BSEBNAd1 EGFP, 

20 respectively; Figures 38G and F) were used as backbones. 

For the preparation of different GTU-2 and GTU-3 vectors (pNRT, 
pTRN, pRTN, pTNR and pRNT; and p2TRN and p2RNT; and p3RNT, Figures 
39A-E, 39F-G and 39H, respectively),- the Nef gene in vectors GTU-2Nef and 
GTU-3Nef was substituted by the respective multireg antigen using Ndel and 

25 Pag I sites. The sequence of the letters N(ef), R(ev) and T(at) in the name 
shows the position of respective coding sequences of the protein in the multi- 
gene. Also two vectors, which contain the IRES element placed into the Sail 
sites following either the multi-antigen or E2 coding sequences, were pre- 
pared (pTRN-iE2-GMCSF and pTRN-iMG-GMCSF, respectively; Figures 39! 

30 and J). The latter sequence, which controls the translation of the coding se- 
quence of the mouse granulocyte-magrophage colony stimulating factor (GM- 
CSF), was cloned into the single BspTI site introduced with IRES. 
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Additionally, a set of the vectors, in which only immunodominant parts 
of the regulatory proteins were used for building up the polyproteins, were 
cloned into the Bsp119l and Notl sites of the GTU-1 (pMVINTR, pMV2NTR, 
pMV1N11TR and pMV2N11TR; Figures 40A-D). In case of the pMV2 con- 

5 structs, linkers that could be digested by intracellular proteases separate the 
regions of the multi-antigene derived from different regulatory proteins. 

Further, GTU-1, GTU-2 and GTU-3 vectors, which express the struc- 
tural proteins encoded by the gag gene or an artificial polyprotein composed 
by previously described CTL epitopes, were prepared. The coding sequences 

10 were cloned as Bsp119l and Not I digested PCR fragments into the GTU-1 
vector (pCTL = BNmCTL, pdgag = pBNdgag, psynp17/24 = pBNsynp17+24, 
poptp17/24= pBNoptp17/24; Figures 41A-D), and transferred in a Nde l-Pac I 
fragment to the GTU-2 (p2mCTL and p2optp17/24; Figures 41 E and F) and 
GTU-3 (p3mCTL and p3optp17/24; Figures 41 G and H). 

15 The coding segment designated as CTL (Sequence Id. No. 10) contains 

fragments from pol and env regions involving many previously identified CTL 
epitopes. The codon usage is optimized so that only codons used frequently in 
human cells are involved. This coding sequence also contains a well- 
characterized mouse CTL epitope used in potency assay and an epitope for 

20 recognition by anti-mouse CD43 antibody. Also, a dominant SIV p27 epitope 
was included for use in potency studies in macaques. 

The dgag contains truncated p17 (start at 13 aa ) +p24+p2+p7 (p1 and 
p6 are excluded) (Sequence Id. No. 11) of gag region of the Han2 isolate. The 
synp17/24 (Sequence Id. No. 12) codes for the p17+p24 polypeptide of the 

25 Han2 HIV-1. The codon usage is modified to be optimal in human cells. Also, 
previously identified AU rich RNA instability elements were removed by this 
way. The optp17/24 coding (Sequence Id. No. 13) region is very similar to the 
synp17/24 with the exception that the two synonymous mutations made 
therein do not change the protein composition but remove a potential splicing 

30 donor site. 

Further, a set of the multi-HIV vectors, which contain both a multireg an- 
tigen and structural antigens as a single polyprotein, were created: pTRN-CTL, 
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pRNT-CTL, pTRN-dgag, pTRN-CTL-dgag, pRNT-CTL-dgag, pTRN-dgag-CTL, 
pRNT-dgag-CTL, pTRN-optpl 7/24-CTL, pTRN-CTL-optp17/24, and pRNT- 
CTL-optp17/24; p2TRN-optp1 7/24-CTL, p2RNT-optp1 7/24-CTL, p2TRN-CTL- 
optp17/24, P 2RNT-CTL-optp17/24, p2TRN-CTL-optp17/24-iE2-mGMCSF, and 
5 p2RNT-CTL-optp17/24-iE2-mGMCSF; and p3TRN-CTL-optp17/24, p3RNT- 
CTL-optp17/24, p3TRN-CTL-optp17/24-iE2-mGMCSF, and p3RNT-CTL- 
optp17/24-iE2-mGMCSF, Figures 42A-T. 

For cloning, as a first step the STOP codon was removed from the regu- 
latory multi-antigen coding sequences. Then the structural antigen coding se- 
10 quences were added by cloning into the Notl site at the end of the frame so 
that a Notl site was .reconstituted. If both CTL and gag were added, the first 
antigen coding sequence was without the STOP codon. Generally, the clon- 
ings were made in context of GTU-1 and for making the respective GTU-2 
(p2...) and GTU-3 (p3...) vectors, the Nef gene in the plasmids GTU-2Nef and 
15 GTU-2Nef was replaced using sites for Ndel and Pag I. However, the RNT- 
optpl 7/24-CTL antigen was built up directly in GTU-2 vector. 

The HIV multi-antigen was cloned to the vectors super6wtd1EGFP and 
FREBNAdlEGFP instead of the dIEGFP using sites for Eco105l and Notl (su- 
per6wt-RNT-CTL-optp17/24 and FREBNA-RNT-CTL-optp17/24; Figures 43V 
20 and 42 U, respectively). If indicated, the IRES and mouse mGM-CSF were 
cloned into the GTU-2 and GTU-3 vectors behind the E2 coding sequence into 
the sites Mph1103l and Eco91l from pTRN-iE2-mGMCSF (cut out using same 
restrictases). 

Finally, "non-GTU" control vector E2BSEBNA-RNT-CTL-optp17/24 
25 (Figure 42W) for the system utilizing EBNA-1 (contains EBNA-1 expression 
cassette with E2 binding sites) was made in a similar way as the FREBNA- 
RNT-CTL-optp17/24. The regular CMV vector pCMV-RNT-CTL-optp17/24 ex- 
pressing the multi HIV antigen (Figure 42D) was made by cloning the multi-HIV 
coding fragment from respective GTU-1 vector using sites for Ndel and Pag I. 

30 
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6.15. EXAMPLE 15: EXPRESSION PROPERTIES OF THE MULTIREG 
ANTIGENS CARRYING ONLY IMMUNODOMINANT REGIONS OF THE 
REGULATORY PROTEINS. 

1 . Intracellular localization of the MultiREG antigens 

5 The intracellular localization of MultiREG antigens expressed by the 

vectors of the invention was studied by in situ immunofluorescence in RD cells 
using monoclonal antibodies against Nef, Rev and Tat proteins essentially as 
described in Example 4. The results are summarized in Table 5 and illustrated 
in figure 45. All antigens that are comprised of intact Nef, Rev and Tat proteins 

10 showed exclusive localization in cytoplasm. The aberrant protein initially de- 
signed as N(ef)T(at)R(ev), which has a frame-shift before the Rev sequence, 
showed only the nuclear localization. MultiREG antigens carrying truncated 
sequences of the regulatory proteins were localized in cytoplasm. In this cases 
distinct structures like "inclusion bodies" were frequently observed. The same 

15 was true for antigens, which carried the protease sites expressed from pMV2 
vectors. However in these cases the proteins in nucleus were also detected 
(Figure 45). 



WO 02/090558 



85 



PCT/FI02/00379 



Table 5. Intracellular localization in multireg antigens 
Construct anti-Nef anti-Rev 



anti-Tat 



empty 
GTU-1 ■ 

pTRN 
pNTR 
pRNT 
pNRT 
pRTN 
pTNR 
pMVINTR 



negative 

strong staining in 
cytoplasm 

strong staining in 
nucleus, nucleolus 

strong staining in 
cytoplasm 

strong, cytoplasmic 
strong, cytoplasmic 
strong, cytoplasmic 
strong, cytoplasmic 



pMV1N11 strong 

TR cytoplasmic+inclusions 

pMV2NTR inclusions in nuclei and 
in cytopi. 

pMV2N1 1 only inclusions, in nuclei 
TR and in cytopi 



negative 

good staining in 
cytoplasm 

negative 

good staining in 
cytoplasm 

good staining in 
cytoplasm 

good staining in 
cytoplasm 

good staining in 
cytoplasm 

cytoplasmic+inclusi 
ons 

cytoplasmic+inclusi 
ons 

inclusions in nuclei 
and cytoplasm 

only inclusions in 
nuclei and in cytopi 



negative 

positive staining in 
cytoplasm 

positive staining in. 
nucleus 

good staining in 
cytoplasm 

good staining in 
cytoplasm 

positive staining in 
cytoplasm 

good staining in 
cytoplasm 

cytoplasmic+inclusion 

s ; 

cytoplasmic+inclusion . 
s 

inclusions in nuclei 
and cytoplasm 

only inclusions in 
nuclei and in cytopi. 



WO 02/090558 



PCT/FI02/00379 



86 

The intracellular localization of dgag and p17+p24 proteins was also 
analyzed in RD cells by immunofluorescence with monoclonal anti p24 anti- 
bodies. In accordance with the Western blot results in Jurkat cells, the dgag 
could not be detected. However, the p17/24 protein showed localization in 
plasma membranes (Figure 45). The localization of CTL protein was not ana- 
lyzed, because no suitable antibody was available. 



6.2 EXAMPLE 15: ANALYSIS OF VECTORS ENCODING RECOM- 
10 BINANT GAG ANTIGENS AND CYTOTOXIC T-CELL EPITOPES (CTL) 

FROM POL 

6.2.1. EXPRESSION 

Analysis of expression of the vectors expressing CTL cds or proteins 
from the gag region were performed by western blot. As seen on Figure 46A 

15 and 46B, the CTL and dgag expression was clearly demonstrated in Cos-7 
cells as predicted size proteins (25kD and 47kD, respectively). The co- 
transfection of the Nef, Rev and Tat significantly enhanced the expression of 
the dgag protein. We interpret this as a result of REV protein action on the 
GAG mRNA expression We also tried to express the dgag protein from GTU-1 

20 vector in Jurkat cells, but we failed to detect any signal (Figure 46C). The 
analysis of the codon usage showed that wt GAG sequence had not optimal 
codon usage for human cells. When the codon usage was optimized (con- 
structs psynp17/24 and poptp17/24), strong p17+p24 (40kD) protein expres- 
sion was detected in Jurkat cells (Figure 46C and 46D). 

25 6.2.2. INTRACELLULAR LOCALIZATION 

For dgag and p17+p24 proteins, the intracellular localization was also 
analyzed in RD cells by immunofluorescence with anti p24 Mab. Similar to the 
western blot results in Jurkat cells, the dgag could not be detected. The p17/24 
protein showed localization in plasma membranes (Figure 47). The localization 

30 of CTL protein was not analyzed caused by lacking of suitable antibody. 
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6.16 EXAMPLE 16: MULTIREG+STRUCTURAL PROTEINS AS MUL- 
TIHIV ANTIGEN EXPRESSION 
5 As next step, the expression of the MultiHIV antigenes consisting of 

both, regular multigene together with gag encoded protein and/or CTL 
multiepitope as single polypeptide was analysed. On Figure 48, the Western 
blot shows the expression of several multiHIV-antigenes expressing vectors 
transfected to the Cos-7 cells. It is clearly seen that the expression levels of all 
10 regulatory+structural multi-antigenes are significantly lower than of the RNT or 
TRN proteins. All tested MultiHIV antigenes migrate in the gel as distinct bands 
near the position of predicted size (73kD for multireg+CTL; 95kD for mul- 
tireg+dgag and 120kD for multireg+CTL+dgag). Similar to the RNT and TRN, 
the RNT-CTL migrates more slowly than TRN-CTL. Also, in cases of both TRN 
15 and RNT constructs, the MultiREG-CTL-dgag combination showed higher ex- 
pression level than MultiREG-dgag-CTL 

More detailed analysis of the multiHIV antigenes was performed in Jur- 
kat cells. For this reason, most of the constructed MultiHIV antigenes (mul- 
tireg+structural), included the MultiREG+CTL+optp17/24 (with predicted size 
20 113kD) were analyzed by Western blotting using antibodies against different 
parts of the antigene. The results are presented on Figure 49 are principially 
similar to those were reported in previous section in case of Cos-7 cells. As it 
was seen in the previous experiments, the dgag containing multi-antigenes 
•express very low levels of the hybrid protein in Jurkat cells. The expression 
25 from the vector pTRNdgag was undetectable on all blots. In lanes loaded ma- 
terial from cells transfected with other dgag containing antigene expression 
vectors, very faint signals only on the Nef Mab hybridized blot were detected at 
positions of predicted sizes. In contrast, if the dgag part is replaced with the 
codon optimized p17/24, the expression level increase was observed. Be- 
30 cause the TRN-CTL-optp 17/24 and RNT-optp17/24 were initially chosen' for 
further analysis, the expression of the antigenes was analyzed from all GTU 
vectors containing these expression cassettes. Also, the E2 protein expression 
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from these plasmids was analyzed. The results are illustrated on Figure 50. 
There are no big differences between the vectors in expression levels of both 
multi-antigene and the E2 protein. The E2 expression level is not significantly 
influenced by presence of IRES element followed mouse GM-CSF gene in the 
5 plasmid, translated from the same mRNA as the E2. 

6.17. EXAMPLE 17: MAINTENANCE OF EXPRESSION OF ANTIGEN 

The maintenance of the plasmid in a population of dividing cells was 
proved using the green fluorescent protein and Nef protein as markers. The 

10 maintenance of the expression of the RNT-CTL-optp17/24 antigen produced 
from different GTU or non-GTU vectors was also analyzed. Specifically, GTU-1 
(p RNT-CTL-optp17/24), GTU-2 (p2 RNT-CTL-optp17/24), GTU-3 (p3 RNT- 
CTL-optp^^), super6wt (super6wt-RNT-CTL-optp17/24) vectors each utilize 
the E2 protein and its binding sites for the plasmid maintenance activity. In this 

15 experiment, also EBNA-1 and its binding site utilizing GTU vector FREBNA- 
RNT-CTL-optp17/24 was included. As negative controls, "non-GTU" plasmid 
containing a mixed pair of the EBNA-1 expression cassette together with E2 
binding sites (E2BSEBNA- RNT-CTL-optp 17/24) was used. Also, regular CMV 
expression vector pCMV- RNT-CTL-optp 17/24 was used. 

20 Jurkat cells were transfected with equimolar amounts of the plas- 

mids and the antigen expression was studied at 2 and 5 days post-transfection 
using a monoclonal anti-Nef antibodies. Transfection with carrier DNA only 
was used as a negative control. The results are presented in Figure 51. 

As it seen from figure 51, the expression is detectable only from 

25 GTU vectors at the second time-point. The antigen expression from the 
FREBNA- RNT-CTL-optp 17/24 was lower at both time-points, because, unlike 
E2, the EBNA-1 does not have transcription activation ability. 

Also the intracellular localization of the multireg+structural pblypro- 
teins was studied by in situ immunofluorescence analysis in RD cells essen- 

30 tially as described in Example 4. The results are presented in figure 52. 

In all cases localization only in cytoplasm was detected using either 
monoclonal anti-Nef or anti-p24 antibodies. In accordance with Western blot 
data, the expression level of optp17/24 containing proteins was much stronger 
than dgag fragment containing antigens. 
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Ail references cited herein are incorporated herein by reference in 
their entirety and for all purposes to the same extent as if each individual pub- 
lication or patent or patent application was specifically and individually indi- 
cated to be incorporated by reference in its entirety for all purposes. 

5 Many modifications and variations of this invention can be made 

without departing from its spirit'and scope, as will be apparent to those skilled 
in the art. The specific embodiments described herein are offered by way of 
example only, and the invention is to be limited only by the terms of the ap- 
pended claims along with the full scope of equivalents to which such claims 

10 are entitled. 
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WHAT IS CLAIMED IS: 

1 . An expression vector comprising: 

(a) a DNA sequence encoding a nuclear-anchoring protein 
operatively linked , to a heterologous promoter, said nu- 
clear-anchoring protein comprising (i) a DNA binding do- 
main which binds to a specific DNA sequence, and (ii) a 
functional domain that binds to a nuclear component, or a 
functional equivalent thereof; and . 

(b) a multimerized DNA sequence forming a binding site for 
the nuclear anchoring protein, 

wherein said vector lacks a papilloma virus origin of replication. 

2. The vector of claim 1 , wherein said vector lacks an origin of replica- 
tion functional in a mammalian cell. 

3, The vector of claim 1 or 2, wherein said nuclear component is mi- 
totic chromatin, the nuclear matrix, nuclear domain 10 (ND10), or nuclear do- 
main POD, 

4, The vector of claim 1 or 2, wherein said nuclear anchoring-protein is 
a chromatin-anchoring protein, and said functional domain binds mitotic chro- 
matin. 

.5. The vector of claim 1 or 2, wherein said nuclear-anchoring protein 
contains a hinge or linker region. 

6. The vector of claim 1 or 2, wherein said nuclear-anchoring protein is 
a natural protein of eukaryotic, prokaryotic, or viral origin. 

7. The vector of claim 6, wherein said natural protein is of viral origin. 

8. The vector of claim 6, wherein said nuclear-anchoring protein is a 
natural protein of eukaryotic origin. 

9. The vector of claim 1 or 2, wherein said nuclear-anchoring protein is 
that of a papilloma virus or an Epstein-Barr virus. 

10. The vector of claim 9, wherein said nuclear-anchoring protein is 
the E2 protein of Bovine Papilloma Virus type 1 or Epstein-Barr Virus Nuclear 
Antigen 1 . 
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11. The vector of claim 10, wherein said nuclear-anchoring protein is 
the E2 protein of Bovine Papilloma Virus type 1. 

12. The vector of claim 1 or 2, wherein said nuclear-anchoring protein 
is a High Mobility Group protein. 

5 13. The vector of claim 1 or 2, wherein said nuclear-anchoring protein 

is a non-natural protein. 

14. The vector of claim- 13, wherein said nuclear-anchoring protein is a 
recombinant protein, a fusion protein, or a protein obtained by molecular mod- 
eling techniques. 

10 15. The vector of claim 14, .wherein said recombinant protein, fusion 

protein, or protein obtained by molecular modeling techniques contains any 
combination of a DNA binding domain which binds to said specific DNA se- 
quence and a functional domain which binds to a nuclear component, wherein 
said functional domain which binds to a nuclear component is that of a papil- 

15 loma virus, an Epstein-Barr-Virus, or a High Mobility Group protein. 

16. The vector of claim 15, wherein said recombinant protein, fusion 
protein, or protein obtained by molecular modeling techniques contains any 
combination of a DNA binding domain which binds to said specific DNA se- 
quence and a functional domain which binds to a nuclear component, wherein 

20 said functional domain which binds to a nuclear component is that of E2 pro- 
tein of Bovine Papilloma Virus type 1, Epstein-Barr Virus Nuclear Antigen 1, or- 
a High Mobility Group protein. 

17. The vector of claim 1 or 2, wherein said vector further comprises 
one or more expression cassettes of a DNA sequence of interest. 

25 18. The vector of claim 17, wherein said DNA sequence of interest is 

that of an infectious pathogen. 

19. the vector of claim 18, wherein said infectious pathogen is a virus. 

20. The vector of claim 19, wherein said virus is selected from the 
group consisting of Human Immunodeficiency Virus (HIV), Herpex Simplex Vi- 

30 rus (HSV), Hepatitis C Virus, Influenzae Virus, and Enterovirus. 

21. The vector of claim 18, wherein said DNA sequence of interest is 
that of a bacterium. 
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22. The vector of claim 21 , wherein said bacterium is selected from the 
group consisting of Chlamydia trachomatis, Mycobacterium tuberculosis, and 
Mycoplasma pneumonia. 

23. The vector of claim 21 , wherein said bacterium is Salmonella. 

5 24. The vector of claim 17, wherein said DNA sequence of interest is 

that of a fungal pathogen. 

25. The vector of claim 24, wherein said fungal pathogen is Candida 

albigans. 

26. The vector of claim 20, wherein said DNA sequence of interest is 
10' of HIV origin. 

27. The vector of claim 26, wherein said DNA sequence of interest en- 
codes a non-structural regulatory protein of HIV. 

28. The vector of claim 27, wherein said non-structural regulatory pro- 
tein of HIV is Nef, Tat or Rev. 

15 29. The vector of claim 28, wherein said nonstructural regulatory pro- 

tein of HIV is Nef. ' 

30. The vector of claim 17, wherein said DNA sequence of interest en- 
codes a structural protein of HIV. . 

31 . The vector of claim 30, wherein said DNA sequence of interest is 
20 the gene encoding HIV gp120/gp1 60. 

32. The vector of claim 17, wherein a first said expression cassette 
comprises a DNA sequence of interest which encodes Nef, Tat or Rev, and 
wherein a second said expression cassette comprises a DNA sequence of in- 
terest which encodes Nef, Tat or Rev. 

25 33. The vector of claim 17, wherein a first said expression cassette 

comprises a DNA sequence of interest which encodes Nef, Tat or Rev, and 
wherein a second said expression cassette comprises a DNA sequence of in- 
terest which encodes a structural protein of HIV! 

34. The vector of claim 1 7, wherein the DNA sequence of interest en- 

30 codes a protein associated with cancer. 
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35. The vector of claim 17, wherein the DNA sequence of interest en- 
codes a protein associated with immune maturation, regulation of immune re- 
sponses, or regulation of autoimmune responses, 

36. The vector of claim 35, wherein said protein is APECED. 

5 37. The vector of claim 17, wherein the DNA sequence of interest is 

the Aire gene. 

38. The vector of claim 17, wherein the DNA sequence of interest en- 
codes a protein that is defective in any hereditary single gene disease. 

39. The vector of claim 17, wherein the DNA sequence of interest en- 
10 codes a macromoiecuiar drug. 

40. Jhe vector of claim 39, wherein the DNA sequence of interest en- 
codes a cytokine. 

41. The vector of claim 40, wherein said cytokine is an interleukin se- 
lected from the group consisting of IL1, 1L2, IL4, IL6 and IL12. 

15 42. The vector of claim 40, wherein the DNA sequence of interest en- 

codes an interferon. 

43. • The vector of claim 17, wherein said DNA sequence of interest en- 
codes a biologically active RNA molecule. 

44. The vector of claim 43, wherein said biologically active RNA mole- 
20 cule is selected from the group consisting of inhibitory antisense and ribozyme 

molecules. 

45. The vector of claim 44, wherein said inhibitory antisense or ri- 
bozyme molecules antagonize the function of an oncogene. 

46. The vector of claim 1 7 for use as a medicament. 

25 47. The vector of claim 17 for use as a carrier vector for a gene, 

genes, or a DNA sequence or DNA sequences of interest, such as a gene, 
genes, or a DNA sequence or DNA sequences encoding a protein or peptide 
of an infectious agent, a therapeutic agent, a macromoiecuiar drug, or any 
combination thereof. 

30 48. The vector of claim 17 for use as a medicament for treating inher- 

ited or acquired genetic defects. 
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49. The vector of claim 17 for use as a therapeutic DNA vaccine 
against an infectious agent. 

50. The vector of claim 1 7 for use as a therapeutic agent. 

51 . A vector of claim 1 7 for the use for production of a protein encoded 
5 by said DNA sequence of interest in a cell or an organism. 

52. A vector of claim 17 for the use for production of a therapeutic 
macromolecular agent in vivo, 

53. A method for providing a protein to a subject, said method 
comprising administering to the subject the vector of claim 1 or 2, wherein said 

10 vector (i) further comprises a second DNA sequence encoding the protein to 
be provided to the subject, which second DNA sequence is operably linked to 
a second promoter, and (ii) does not encode Bovine Papilloma Virus protein 
E1, and wherein said subject does not express Bovine Papilloma Virus protein 
El 

15 54. A method for inducing an immune response to a protein in a sub- 

ject, said method comprising administering to the subject the vector of claim 1 
or 2, wherein said vector (i) further comprises a second DNA sequence encod- 
ing said protein, which second DNA sequence is operably linked to a second 
promoter, and (ii) does not encode Bovine Papilloma Virus protein E1, and 

20 wherein said subject does not express Bovine Papilloma Virus protein E1 . 

55. A method for treating an infectious disease in a subject in need of 
said treatment, said method comprising administering to said subject a thera- 
peutically effective amount of the vector of claim 17, wherein said DNA se- 
quence of interest encodes a protein comprising an immunogenic epitope of 

25 an infectious agent. 

56. A method for treating an inherited or acquired genetic defect in a 
subject in need of said treatment, said method comprising: administering to 
said subject a therapeutically effective amount of the vector of claim 17, 
wherein said DNA sequence of interest encodes a protein which is affected by 

30 said inherited or acquired genetic defect. 

57. A method for expressing a DNA sequence in a subject, said 
method comprising administering the vector of claim 17 to said subject. 
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58. The method of claim 55, 56, or 57, wherein said vector does not 
encode Bovine Papilloma Virus protein E1, and wherein said subject does not 
express Bovine Papilloma Virus protein E1 . 

59/ A method for the preparation of a vector of claim 1 or 2 comprising: 
5 (a) cultivating a host cell containing said vector; and 

(b) recovering the vector. 

60. The method of claim 59, further comprising before step (a) a step, 
of transforming said host cell with said vector. 

61 . The method of claim 59, wherein said host cell is a prokaryotic cell. 
10 62. The method of claim 59, wherein said host cell is an Escherichia 

coli. 

63. A host cell, characterized by containing the vector of claim 1 or 2. 

64. A host cell, characterized by containing the vector of claim 1 7. 

65. The host cell of claim 63, wherein said host cell is a bacterial cell. 
15 66. The host cell of claim 63, wherein said host cell is a mammalian 

cell. 

67. A carrier vector containing the vector of claim 1 or 2. 

68. A pharmaceutical composition comprising the vector of claim 17 
and a suitable pharmaceutical carrier. 

20 69. A DNA vaccine containing the vector of claim 1 7. 

70. A DNA vaccine containing the vector of claim 1 8. 

71 . A DNA vaccine containing the vector of claim 1 9. 

72. A DNA vaccine containing the vector of claim 21 . 

73. A DNA vaccine containing the vector of claim 24. 

25 74. A gene therapeutic agent containing the vector of claim 17. ' 

75. A method for the preparation of the DNA vaccine of claim 69, said 
method comprising combining the vector of claim 17 with a suitable pharma- 
ceutical vehicle. 

76. A method for the preparation of the DNA vaccine of claim 69, said 
30 method comprising combining the vector of claim 18 with a suitable pharma- 
ceutical vehicle. 
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77. A method for the preparation of the DNA vaccine of claim 69, said 
method comprising combining the vector of claim 19 with a suitable pharma- 
ceutical vehicle. 

78. A method for the preparation of the DNA vaccine of claim 69, said 
5 method comprising combining the vector of claim 21 with a suitable pharma- 
ceutical vehicle. 

79. A method for the preparation of the DNA vaccine of claim 69, said 
method comprising combining the vector of claim 24 with a suitable pharma- 
ceutical vehicle. 

io 80. A method for the preparation of the agent of claim 74, said method 

comprising combining the vector of claim 17 with a suitable pharmaceutical 
vehicle. 



\ 



WO 02/090558 



1/91 



PCT/FI02/00379 




SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 PCT/FI02/00379 



2/91 




FIG. 2 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 n»i»s <a^-?» PCT/FI02/00379 



3/91 




FIG. 3 



SUBSTITUTE SHEET (RULE 26) 




SUBSTITUTE SHEET (RULE 26) 



5/91 



PCT/FI02/00379 




SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 




PCT/FI02/00379 



FIG. 6A 




FIG- 6B 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 PCT/FI02/00379 

7/91 




1 2 3 4 5 6 



FIG- 7 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 %§ %J PCT/FI02/00379 

8/91 




E2-S*- — «•* •, — 

E2 Mabs 



1 2 3 4 5 6 7 8 9 10 11 12 13 14 

44h 114h 



FIG- 8 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 ^ ^ PCT/FI02/00379 

9/91 




Nef Wlabs 



E2 — > — 

. *. E2Mabs 



1 2 3 4 5 6 7 8 9 10 11 12 13 14 

45h 119h 



FIG. 9 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 %f PCT/FI02/00379 

10/91 




SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 w „ _ PC T/FI02/00379 



11/91 




1 2 3 4 5 6 7 8 9 10 11 12 13 14 
39h 114h 

F!G|- 11 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



12/91 



PCT/FI02/00379 




SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 




PCT/FI02/00379 



TP 



♦ f 

123456789 



FIG- 13 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 %f PCT/FI02/00379 

14/91 



CHO 



Jurkat 



n r 



M .M M . . 2#1 ,72h ?4h ?2h 2fr 




4? 



«f ^ ^ ^ 

M M M 9€h SSh S€h 



, w *'-~K r 




? • iff. 
■fe « Mr. 



1, 




1 2 3 4 5 




r:> .vV"- V.* 



4 i vr£ 5 * i " 



7 8 9 



FIG. 14 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



PCT/FI02/00379 



15/91 



#V #4? ^ 



HPV11 

M M I NNV-2wt |NNV-2wtFS»product1wtl ORI, M 




123456789 10 



FIG- 15 



SUBSTITUTE SHEET (RULE 26) 




PCT/FI02/00379 



FIG- 16 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



17/91 



PCT/FI02/00379 




FIG. 17 



SUBSTITUTE SHEET (RULE 26) 




PCT/FI02/00379 



FIG. 18 



SUBSTITUTE SHEET (RULE 26) 




SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 ^) PCT/FI02/00379 

20/91 



). 

to 




jm/siiao jo-ou 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 ^ ^ PCT/FI02/00379 

21/91 







in 






U- 


FP 




FP 


o 


CM 
<D 
O) 


CD 


UJ 


LU 








i 


O 








1 


CM 


cn 



o 

_ ... Yz 



\ ^ 



r "' - : > r --' i ,- 1 ^ " 




.•wis** 



HP 



j - - . , - xw '- V> - tviT. ." > \ Mil. ■' « 



Ms* 



CO 
CM 

T— 

"T— 
T — 

o 

CD 
00 
N- 
CD 
lO 

CO 
CM 



O 
O 
O 

o 
o 
o 
o 
o 
o 



o 
o 
o 
o 
o 
o 
o 
o 



g! 
o 
o 
o 
o 
o 



o 
o 
o 
o 
o 
o 



|UJ/S||SO J.O *<^U 



c 
o 

o 

CD 

c 
re 



o 

CL 

>> 

T3 



CM 

O 
LL 



o 
o 
o 
o 
o 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 




PCT/FI02/00379 



22/91 



Dm 
U. 
O 

UJ 

-o IX. 
CN! <N 



Dl 
Li. 

CD 
UJ 



a 

Li. 

o 

LU 

> 



t 




CM 

6 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 PCT/F102/00379 



23/91 



CO 
LL 

a, q. 

Li- Li. 

to 



LU LU 
■a 



cn cvj cr 

f 4 ! 




ion 


CO 


o 


CM 










c 

F 3 


6 


st-tr 


Li- 


o 




a. 




w 




>* 








T3 





o to o 
to «^ Ta- 



rn o m o 

Ol CN V 



sjiao Buissajdxa dd£)3l.P J° % 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 




PCT/FI02/00379 



24/91 





(O 






U- 


CL 


CL 


CL 


LL 


U- 


GF 


(D 


O 


ill 


LU 


UJ 










i 


i 










CM 


CM 






c 
o 

o 

</> 
c 



to 
o 
Q. 

>i 



g 

LL 



|iu/snao Buissajdxa dd93l-P J° " ou 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 P JP PCT/FI02/00379 

25/91 




SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 fl) PCT/FI02/00379 

26/91 



Proliferation response in GTU-Nef immunized patients (1 

ug group) 




Patient 1 
Patient 2 
Patient 3 
, — patient 4 
Patient 6 



FIG- 26 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 || J J PCT/FI02/00379 

27/91 



Proliferation response in GTU-Nef immunized 
patients (20 ug group) 




■Patients 
. P-atient 9 
Patient 10 
Patient 11 
Patient 12 



Weeks 



FIG- 27 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 || %J PCT/FI02/00379 

28/91 




FIG- 28 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 




PCT/FI02/00379 



S9Q4.£colLY 



FIG. 29A 

Plasmid s6E2dlEGFP 




FIG. 29B 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 J) J PCT/FI02/00379 

30/91 




FIG- 29C 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



31/91 



PCT/FI02/00379 




FIG- 30 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 M M PCT/FI02/00379 



32/91 




SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 || PCT/F102/00379 

33/91 




SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 PCT/FI02/00379 



34/91 




CD 



- 00 



CO 



- CM 



5 



00 

o 



OOOOOOOQOOO 

oooooooooo 
oooooooooo 

OOOOOOOOOO 

oooooooooo 
ooooh-coiOTj-cocvj^- 



co 

CO 



LL 



(Ui in sijeo 6uiss9jdx9 ddOBtP 3P ' ou 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



PCT/FI02/00379 



o5 300000 



u 

05 
C 

'en 
cn 

£ 

Q. 

8:- £ 
o 

UJ 

*B 

d 
c 



250000 
200000 
150000 
100000 
50000 
0 



35/91 

293(E2) 



•s6HPV1lOri 
■ FREBNA 




2 4 6 8 

days post transfection 



10 



FJG- 34A 




10000 -I , , 1 ■ — i 

0 1 2 3 4 5 



days post transfection 



FIG- 34B 



SUBSTITUTE SHEET (RULE 26) 



36/91 



PCT/FI02/00379 




FIG- 35 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 f§ || PCT/FI02/00379 

37/91 



Immuno II! Amice ELISA results 




1 2 3 4 5 6 7 
Group number 



STat response 
□ Rev response 



FIG. 36 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 || || PCT/F102/00379 

38/91 



Immuno III A mice EL IS A results 





0,8 - 




0,7 - 


£ 
c 


0,6 - 


0.5 - 


in 
o 


0,4 - 


a 


0,3 - 


o 






0,2- 




0,1 - 




0 - 



Hltii 



345 

Group num ber 



g|<3ag response 
ggCTL response 



FIG. 37 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



39/91 



PCT/FI02/00379 




FIG. 38 A 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 \g«P 0 PCT/FI02/00379 

40/91 




SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



41/91 



PCT/FI02/00379 




FIG. 38C 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



42/91 



PCT/FI02/00379 




FIG- 38D 



SUBSTITUTE SHEET (RULE 26) 



i 



WO 02/090558 WW U PCT/FI02/00379 

43/91 




FIG- 38E 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 PCT/F102/00379 

44/91 




,B:o91! 1706 



FIG- 38F 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



PCT/FI02/00379 



45/91 




r^ph1103i 1997 



FIG. 38G 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



46/91 



PCT/FI02/00379 



tyjhl 1031 5816. 




FIG- 39A 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



47/91 



PCT/FI02/00379 



K/fch1103l 5817 




FIG- 39B 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



48/91 



PCT/FI02/00379 



MpM 1031 5817 




FIG- 39C 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



49/91 



PCT/FI02/00379 



l^ph1103l 5616 




FIG- 39D 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



50/91 



PCT/FI02/00379 



kfeh1103l 5816. 




FIG. 39E 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



51/91 



PCT/FI02/00379 




FIG- 39F 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



PCT7FI02/00379 




SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



PCT/FI02/00379 



53/91 




MpW 1031 1999 



Eco91!3993 



FIG. 39H 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



PCT/FI02/00379 



54/91 



Mph1103l 




FIG- 39 I 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



55/91 



PCT/FI02/00379 



f^h1l03l 




Eco91l4140 



FIG- 39J 



SUBSTITUTE SHEET (RULE 26) 




PCT/FI02/00379 



FIG. 40A 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



57/91 



PCT/FI02/00379 



Mph1103l 5732. 




FIG- 40B 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



58/91 



PCT/FI02/00379 




FIG- 40C 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



59/91 



PCT/FI02/00379 




FIG- 40D 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



Mph 11031 5219, 




PCT/FI02/00379 



FIG. 41A 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 — — PCT/FI02/00379 

61/91 




FIG- 41B 



SUBSTITUTE SHEET (RULE 26) 



wo 




PCT/FI02/00379 



FIG. 41 C 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



63/91 



PCT/FI02/00379 




FIG- 41D 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



64/91 



PCT/FI02/00379 




FIG. 41E 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 PCT/FI02/00379 

65/91 




lYfchl 1031 3682 



FIG. 41F 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



66/91 



PCT/FI02/00379 




'Nfcih1103i 1999 



FIG- 41G 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



67/91 



PCT/FI02/00379 



PagI 7029 




Nfchl 1031 1999 



Eco91l3993 



FIG- 41H 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



68/91 



PCT/FI02/00379 




SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



69/91 



PCT/FI02/00379 



Mph1103l 




Bso91l4219 



FIG- 42B 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



70/91 



PCT/FI02/00379 




FIG- 42C 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



71/91 



PCT/FI02/00379 




2366 



FIG- 42D 



SUBSTITUTE SHEET (RULE 26) 




SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



73/91 



PCT/FI02/00379 




FIG. 42F 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 _ _ PCT/FI02/00379 

74/91 




FIG- 42G 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



75/91 



PCT/FI02/00379 




FIG. 42H 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 




PCT/FI02/00379 



76/91 




FIG. 42 S 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



77/91 



PCT/FI02/00379 




FIG- 42J 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



78/91 



PCT/FI02/00379 



Pagl 8957 




FIG. 42K 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



79/91 



PCT/FI02/00379 

t 



Pagl 8957 




FIG. 42 L 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



PCT/FI02/00379 



80/91 




Bsp1 191 5779 




pUCori 



62O1051 5397 
Ndel 5292 



fcfch1103! 3682 



FIG- 42M 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



PCT/FI02/00379 



81/91 




FIG. 42N 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 




PCT/FI02/00379 



82/91 




FSG. 42 O 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 




PCT/FI02/00379 



83/91 




FIG- 42P 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



PCT/FI02/00379 



84/91 




,^h1 1031 1999 



FIG. 42Q 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 ^ ^ r PCT/FI02/00379 

85/91 




Wfch1 1031 1999 



FIG. 42R 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



PCT/FI02/00379 



86/91 



Pagl9556 




BspTl5165 



FIG. 42S 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 W W PCT/FI02/00379 

87/91 




N/tih1103l19S 



BapTl 5165 



FIG- 42T 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



88/91 



PCT/FI02/00379 



Sail 15 




6138 



NdeI6122 
Ndei 6082 



FIG- 42U 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 — ' W PCT/FI02/00379 

89/91 



. Sail 15 
Not! 8706I .Rag! 150 



B:o91l7430 



Bsp1191 5678 




B:o91t2277 



Sail 2856 
^h1 1031 2870 



6:o1 051 5296 
r4de»5191 



~] | Ragl3885 
f^M 1031 4479 Mph1 1031 4213 



FIG. 42V 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



PCT/FI02/00379 



90/91 




FIG- 42W 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



91/91 



PCT/FI02/00379 



B=o9115118, 




FIG. 42X 



SUBSTITUTE SHEET (RULE 26) 



WO 02/090558 



PCT/FI02/00379 



' , SEQUENCE LISTING 

<1'10> FIT Biotech" Oyj Pic 

<120> Novel expression vectors and uses thereof 

<130><- 'il041-006-228 (2010149) 

<140> To be assigned 
<141>' 2.002-05-03 

<150> FI -20010922 
<151> 2001-05-03 

<160> 52 - 

<170> Patentln version 3.1 

<210> 1 
<211> 1260 
<212> DNA 

<213> Artificial Sequence 
• <220> 

<223> Hybrid protein comprised of Nef-Tat-Rev (NTR) 
<400> 1 

atggtgggca agtggtcaaa atgtagtgga tggcctactg taagggaaag aatgaaacaa 60 

. gctgagcctg agccagcagc agatggggtg ggagcagcat ctcgagacct ggaaaaacat 120 

ggagcaatca caagtagcaa tacagcaact aataacgctg cttgtgcctg gctagaagca 180 

caagaggaag aggaagtggg ttttccagtc agacctcagg tacctttaag accaatgact 240 

tacaagggag ctttagatct tagccacttt ttaaaagaaa aggggggact ggaagggtta 300 

atttactccc caaaaagaca agagatcctt gatctgtggg tctaccacac acaaggctac 360 

ttccctgatt ggcagaacta cacaccaggg ccaggggtca- gatatccact gacctttgga 420 

tggtgcttca agttagtacc agttgaacca gatgaagaag agaacagcag cctgttacac 480 

cctgcgagcc tgcatgggac agaggacacg gagagagaag tgttaaagtg gaagtttgac 540 

agccatctag catttcatca caaggcccga gagctgcatc cggagtacta caaagactgc 600 

actagtgcag gaagaagcgg agacagcgac gaagagctcc tcaagacagt cagactcatc 660 

aagtttctct accaaagcaa ccctcctccc agcaacgagg ggacccgaca ggcccgaaga 720 

aatcgaagaa gaaggtggag agagagacag aggcagatcc gttcgattag tgagcggatt 780 

cttagcactt ttctgggacg acctgcggag cctgtgcctc ttcagctacc gccgcttgag 840 

agacttactc ttgattgtag cgaagattgt ggaaactctg ggacgcaggg ggtgggaagt 900 

cctcaagtat tggtggaatc tcctgcagta ttggagccag gaactaaaga aaagcttgag 960 

ccagtagatc ctagactaga gccctggaag catccaggaa gtcagcctag gaccccttgt 1020 

accaattgct attgtaaaaa gtgttgcctt cattgccaag tttgtttcac aagaaaaggc 1080 

ttaggcatct cctatggcag gaagaagcgg agacagcgac gaagagctcc tcaagacagt 1140 

cagactcatc aagtttctct accaaagcaa ccctcctccc agcaacgagg ggacccgaca 1200 

ggcccgaaga aatcgaagaa gaaggtggag agagagacag aggcagatcc gttcgattag 1260 

<210> 2 
<211> 1260 ' 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Hybrid protein comprised of Tat-Rev-Nef (TRN) 
<400> 2 / 

atggagccag tagatcctag actagagccc tggaagcatc caggaagtca gcctaggacc 60 
ccttgtacca attgctattg taaaaagtgt tgccttcatt gccaagtttg tttcacaaga 120 
aaaggcttag gcatctccta tggcaggaag aagcggagac agcgacgaag agctcctcaa 180 
gacagtcaga ctcatcaagt ttctctacca aagcaaccct cctcccagca acgaggggac 24 0 
ccgacaggco cgaagaaatc gaagaagaag gtggagagag agacagaggc agatccgttc 300 
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gatactagtg caggaagaag cggagacagc gacgaagagc tcctcaagac agtcagactc 360 

atcaagtttc tctaccaaag caaccctcct cccagcaacg aggggacccg acaggcccga 420 

agaaatcgaa gaagaaggtg gagagagaga cagaggcaga tccgttcgat tagtgagcgg 480 

attcttagca cttttctggg acgacctgcg gagcctgtgc ctcttcagct accgccgctt 540 

gagagactta ctcttgattg tagcgaagat tgtggaaact ctgggacgca gggggtggga . 600 

agtcctcaag tattggtgga atctcctgca gtattggagc caggaactaa agaaaagctt 660 

gtgggcaagt ggtcaaaatg tagtggatgg cctactgtaa gggaaagaat gaaacaagct 720 

gagcctgagc cagcagcaga tggggtggga gcagcatctc gagacctgga aaaacatgga 780 

gcaatcacaa gtagcaatac agcaactaat aacgctgctt gtgcctggct agaagcacaa 840 

gaggaagagg aagtgggttt tccagtcaga cctcaggtac ctttaagacc aatgacttac j 900 

aagggagctt tagatcttag ccacttttta aaagaaaagg ggggactgga agggttaatt 960 

tactccccaa aaagacaaga gatccttgat ctgtgggtct accacacaca aggctacttc 1020 

cctgattggc agaactacac accagggcca ggggtcagat atccactgac ctttggatgg 1080 

tgcttcaagt tagtaccagt tgaaccagat gaagaagaga acagcagcct gttacaccct 1140 

gcgagcctgc atgggacaga ggacacggag agagaagtgt taaagtggaa gtttgacagc 1200 

catctagcat ttcatcacaa ggcccgagag ctgcatccgg agtactacaa agactgctga \ 1260 

<210> 3 
<211> 1260 

<212> DNA , 
<213> Artificial Sequence 

<220> 

•<223> Hybrid protein comprised of Rev-Tat -Nef (RTN) 
<400> 3 

atggcaggaa gaagcggaga cagcgacgaa gagctcctca agacagtcag actcatcaag 60 

tttctctacc aaagcaaccc tcctcccagc aacgagggga cccgacaggc ccgaagaaat 120 
cgaagaagaa ggtggagaga gagacagagg cagatccgtt cgattagtga gcggattctt 
agcacttttc tgggacgacc tgcggagcct gtgcctcttc agctaccgcc gcttgagaga 
cttactcttg attgtagcga agattgtgga aactctggga cgcagggggt gggaagtcct 

caagtattgg tggaatctcc tgcagtattg gagccaggaa ctaaagaaac tagtgagcca 360 

gtagatccta gactagagcc ctggaagcat ccaggaagtc agcctaggac cccttgtacc 420 

aattgctatt gtaaaaagtg ttgccttcat tgccaagttt gtttcacaag aaaaggctta 480 

ggcatctcct atggcaggaa gaagcggaga cagcgacgaa gagctcctca agacagtcag 540 

actcatcaag tttctctacc aaagcaaccc tcctcccagc aacgagggga cccgacaggc 600 

ccgaagaaat cgaagaagaa ggtggagaga gagacagagg cagatccgtt cgataagctt 660 

gtgggcaagt ggtcaaaatg tagtggatgg cctactgtaa gggaaagaat gaaacaagct 720 

gagcctgagc cagcagcaga tggggtggga gcagcatctc gagacctgga aaaacatgga 780 

gcaatcacaa gtagcaatac agcaactaat aacgctgctt gtgcctggct agaagcacaa 840 

gaggaagagg aagtgggttt tccagtcaga cctcaggtac ctttaagacc aatgacttac 900 

aagggagctt tagatcttag ccacttttta aaagaaaagg ggggactgga agggttaatt 960 

tactccccaa aaagacaaga gatccttgat ctgtgggtct accacacaca aggctacttc .1020 

cctgattggc agaactacac accagggcca ggggtcagat atccactgac ctttggatgg 1080. 

tgcttcaagt tagtaccagt tgaaccagat gaagaagaga acagcagcct gttacaccct 1140 

gcgagcctgc atgggacaga ggacacggag agagaagtgt taaagtggaa gtttgacagc 1200 

catctagcat ttcatcacaa ggcccgagag ctgcatccgg agtactacaa agactgctga 1260 

<210> 4 
<211> 1260 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Hybrid protein comprised of Tat-Nef-Rev (TNR) 
<400> 4 

atggagccag tagatcctag actagagccc tggaagcatc caggaagtca gcctaggacc 60 

ccttgtacca attgctattg taaaaagtgt tgccttcatt gccaagtttg tttcacaaga 120 

aaaggcttag gcatctccta tggcaggaag aagcggagac agcgacgaag agctcctcaa 180 

gacagtcaga ctcatcaagt ttctctacca aagcaaccct cctcccagca acgaggggac 240 

ccgacaggcc cgaagaaatc gaagaagaag gtggagagag agacagaggc agatccgttc 300 

gatactagtg tgggcaagtg gtcaaaatgt agtggatggc ctactgtaag ggaaagaatg 360 
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aaacaagctg ' agcctgagcc agcagcagat ggggtgggag cagcatctcg agacctggaa 420 

aaacatggag caatcacaag tagcaataca gcaactaata acgctgcttg tgcctggcta 480 

gaagcacaag aggaagagga agtgggtttt ccagtcagac ctcaggtacc tttaagacca 540 

atgacttaca "agggagcttt agatcttagc cactttttaa aagaaaaggg gggactggaa 600 

gggttaattt actccccaaa aagacaagag atccttgatc tgtgggtcta ccacacacaa 660 

ggctacttec ctgattggca gaactacaca ccagggccag gggtcagata tccactgacc 720 

tttggatggt gcttcaagtt agtaccagtt gaaccagatg aagaagagaa cagcagcctg 780 

ttacaccctg cgagcctgca tgggacagag gacacggaga gagaagtgtt aaagtggaag 840 

tttgacagcc atctagcatt tcatcacaag gcccgagagc tgcatccgga gtactacaaa 900 

gactgcaagc ttgcaggaag aagcggagac agcgacgaag agctcctcaa gacagtcaga 960 

ctcatcaagt .ttctctacca aagcaaccct cctcccagca acgaggggac ccgacaggcc 1020 

cgaagaaatc gaagaagaag gtggagagag agacagaggc agatccgttc gattagtgag 1080 

cggattctta gcacttttct gggacgacct gcggagcctg tgcctcttca gctaccgccg 1140 

cttgagagac ttactcttga ttgtagcgaa gattgtggaa actctgggac gcagggggtg 1200 

ggaagtcctc aagtattggt ggaatctcct gcagtattgg agccaggaac taaagaatag 1260 

<210> 5 
<211> 1260 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Hybrid protein comprised of Rev-Nef-Tat (RNT) 
<400> 5 

atggcaggaa gaagcggaga cagcgacgaa gagctcctca agacagtcag actcatcaag 60 

tttctctacc aaagcaaccc tcctcccagc aacgagggga cccgacaggc ccgaagaaat 120 

cgaagaagaa ggtggagaga gagacagagg cagatccgtt cgattagtga gcggattctt '180 

agcacttttc tgggacgacc tgcggagcct gtgcctcttc agctaccgcc gcttgagaga 240 

cttactcttg attgtagcga agattgtgga aactctggga cgcagggggt gggaagtcct .300 

caagtattgg tggaatctcc tgcagtattg gagccaggaa ctaaagaaac tagtgtgggc 360 

aagtggtcaa aatgtagtgg atggcctact gtaagggaaa gaatgaaaca agctgagcct 420 

gagccagcag cagatggggt gggagcagca tctcgagacc tggaaaaaca tggagcaatc 480 

acaagtagca atacagcaac taataacgct gcttgtgcct ggctagaagc acaagaggaa 540 

gaggaagtgg gttttccagt cagacctcag gtacctttaa gaccaatgac ttacaaggga 600 

gctttagatc ttagccactt tttaaaagaa aaggggggac tggaagggtt aatttactcc 660 

ccaaaaagac aagagatcct tgatctgtgg gtctaccaca cacaaggcta cttccctgat 720 

tggcagaact acacaccagg gccaggggtc agatatccac tgacctttgg atggtgcttc 780 

aagttagtac cagttgaacc agatgaagaa gagaacagca gcctgttaca ccctgcgagc 840 

ctgcatggga cagaggacac ggagagagaa gtgttaaagt ggaagtttga cagccatcta 900 

gcatttcatc acaaggcccg agagctgcat ccggagtact acaaagactg caagcttgag 960 

ccagt agate ctagactaga gccctggaag catccaggaa gtcagectag gaccccttgt 1020 

accaattget attgtaaaaa gtgttgcctt cattgecaag tttgtttcac aagaaaaggc 1080 

ttaggcatct cctatggcag gaagaagegg agacagegae gaagagctcc tcaagacagt 1140 

cagactcatc aagtttctct accaaagcaa ccctcctccc agcaacgagg ggacccgaca 1200 

ggeccgaaga aatcgaagaa gaaggtggag agagagacag aggcagatcc gttcgattag 1260 

<210> 6 
<211> 1164 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Protein comprised of Immunodominant parts of the Nef -Tat-Rev (NTR) 
<400> 6 

atgggatggc ctactgtaag ggaaagaatg aaacaagctg agcctgagcc agcagcagat 60 

ggggtgggag cagcatctcg agacctggaa aaacatggag caatcacaag tagcaataca 120 

gcaactaata acgctgcttg tgcctggcta gaagcacaag aggaagagga agtgggtttt 180 

ccagtcagac ctcaggtacc tttaagacca atgacttaca agggagcttt agatcttagc 240 

cactttttaa aagaaaaggg gggactggaa gggttaattt actccccaaa aagacaagag 300 

atccttgatc tgtgggtcta ccacacacaa ggctacttec ctgattggca gaactacaca 360 

ccagggccag gggtcagata tccactgacc tttggatggt gcttcaagtt agtaccagtt 420 
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gaaccagatg aagaagagaa cagcagcctg ttacaccctg cgagcctgca tgggacagag 480 

gacacggaga gagaagtgtt aaagtggaag tttgacagcc atctagcatt tcatcacaag 540 

gcccgagagc tgcatccgga gtactacaaa gactgcgctc tggccgccgt tgagccagta 600 

gatcctagac tagagccctg gaagcatcca ggaagtcagc ctaggacccc ttgtaccaat 660 

tgctattgta aaaagtgttg ccttcattgc caagtttgtt tcacaagaaa aggcttaggc 720 

atctcctatg gcaggaagaa gcggagacag cgacgaagag ctcctcaaga cagtcagact 780 

catcaagttt ctctaccaaa gcaaccctcc tcccagcaac gaggggaccc gacaggcccg 840 

aagaaatccg gactggccat cctgctgagc gacgaagagc tcctcaagac agtcagactc 900 

atcaagtttc tctaccaaag caaccctcct cccagcaacg aggggacccg acaggcccga 960 

agaaatcgaa gaagaaggtg gagagagaga cagaggcaga tccgttcgat tagtgagcgg 1020 

attcttagca cttttctggg acgacctgcg gagcctgtgc ctcttcagct accgccgctt 1080 

gagagactta ctcttgattg tagcgaagat tgtggaaact ctgggacgca gggggtggga 1140 

agtcctcaag tattggtgga atga 1164 

<210> 7 

<211> 1173 

<212> DNA 

<213> Artificial Sequence 
c220> 

<223> Protein comprised of Immunodominant parts of the Nef-Tat-Rev 
separated by protease sites (NTR) 

<400> 7 

atgggatggc ctactgtaag ggaaagaatg aaacaagctg agcctgagcc agcagcagat 
ggggtgggag cagcatctcg agacctggaa aaacatggag caatcacaag tagcaataca 
gcaactaata acgctgcttg tgcctggcta gaagcacaag aggaagagga agtgggtttt 
ccagtcagac ctcaggtacc tttaagacca atgacttaca agggagcttt agatcttagc 
cactttttaa aagaaaaggg gggactggaa gggttaattt actccccaaa aagacaagag 
atccttgatc tgtgggtcta ccacacacaa ggctacttcc ctgattggca gaactacaca 
ccagggccag gggtcagata tccactgacc tttggatggt gcttcaagtt agtaccagtt 
gaaccagatg aagaagagaa cagcagcctg ttacaccctg cgagcctgca tgggacagag 
gacacggaga gagaagtgtt aaagtggaag tttgacagcc atctagcatt tcatcacaag 
gcccgagagc tgcatccgga gtactacaaa gactgcgctc tggccttcaa gcgggttgag 
ccagtagatc ctagactaga gccctggaag catccaggaa gtcagcctag gaccccttgt 
accaattgct attgtaaaaa gtgttgcctt cattgccaag tttgtttcac aagaaaaggc 
ttaggcatct cctatggcag gaagaagcgg agacagcgac gaagagctcc tcaagacagt 
cagactcatc aagtttctct accaaagcaa ccctcctccc agcaacgagg ggacccgaca 
ggcccgaaga aatccgtacg ggagaagcgg ctgctgagcg acgaagagct cctcaagaca 
gtcagactca tcaagtttct ctaccaaagc aaccctcctc ccagcaacga ggggacccga 
caggcccgaa gaaatcgaag aagaaggtgg agagagagac agaggcagat ccgttcgatt 
agtgagcgga ttcttagcac ttttctggga cgacctgcgg agcctgtgcc tcttcagcta 
ccgccgcttg agagacttac tcttgattgt agcgaagatt gtggaaactc tgggacgcag 
ggggtgggaa gtcctcaagt attggtggaa tga 

<210> 8 

<211> 1161 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Protein comprised of Immunodominant parts of the regulatory proteins 
Nef-Tat-Rev started from aal of Nef (N11TR) 

<400> 8 

atgtggccta ctgtaaggga aagaatgaaa caagctgagc ctgagccagc agcagatggg 60 

gtgggagcag catctcgaga cctggaaaaa catggagcaa tcacaagtag caatacagca 120 

actaataacg ctgcttgtgc ctggctagaa gcacaagagg aagaggaagt gggttttcca 180 

gtcagacctc aggtaccttt aagaccaatg acttacaagg gagctttaga tcttagccac 240 

tttttaaaag aaaagggggg actggaaggg ttaatttact ccccaaaaag acaagagatc 300 

cttgatctgt gggtctacca cacacaaggc tacttccctg attggcagaa ctacacacca 360 

gggccagggg tcagatatcc actgaccttt ggatggtgct tcaagttagt accagttgaa 420 

ccagatgaag aagagaacag cagcctgtta caccctgcga gcctgcatgg gacagaggac 480 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1173 
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acggagagag aagtgttaaa gtggaagttt gacagccatc tagcatttca tcacaaggcc 
cgagagctgc atccggagta ctacaaagac tgcgctctgg ccgccgttga gccagtagat 
cctagactag agccctggaa gcatccagga agtcagccta ggaccccttg taccaattgc 
tattgtaaaa agtgttgcct tcattgccaa gtttgtttca caagaaaagg cttaggcatc 
tcctatggca ggaagaagcg gagacagcga cgaagagctc ctcaagacag tcagactcat 
caagtttctc taccaaagca accctcctcc cagcaacgag gggacccgac aggcccgaag 
aaatccggac tggccatcct gctgagcgac gaagagctcc tcaagacagt cagactcatc 
aagtttctct accaaagcaa ccctcctccc agcaacgagg ggacccgaca ggcccgaaga 
aatcgaagaa gaaggtggag agagagacag aggcagatcc gttcgattag tgagcggatt 
cttagcactt ttctgggacg acctgcggag cctgtgcctc ttcagctacc gccgcttgag 
agacttactc ttgattgtag cgaagattgt ggaaactctg ggacgcaggg ggtgggaagt 
cctcaagtat tggtggaatg a 

<210> 9 

<211> 1170 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Protein comprised of Immunodominant parts of the regulatory proteins 
Nef-Tat-Rev started from aal of Nef separated by protease sites (N11TR) 

<400> 9 

atgtggccta ctgtaaggga aagaatgaaa caagctgagc ctgagccagc agcagatggg 
gtgggagcag catctcgaga cctggaaaaa catggagcaa tcacaagtag caatacagca 
actaataacg ctgcttgtgc ctggctagaa gcacaagagg aagaggaagt gggttttcca 
gtcagacctc aggtaccttt aagaccaatg acttacaagg gagctttaga tcttagccac 
tttttaaaag aaaagggggg actggaaggg ttaatttact ccccaaaaag acaagagatc 
cttgatctgt gggtctacca cacacaaggc tacttccctg attggcagaa ctacacacca 
gggccagggg tcagatatcc actgaccttt ggatggtgct tcaagttagt accagttgaa 
ccagatgaag aagagaacag cagcctgtta caccctgcga gcctgcatgg gacagaggac 
acggagagag aagtgttaaa gtggaagttt gacagccatc tagcatttca tcacaaggcc 
cgagagctgc atccggagta ctacaaagac tgcgctctgg ccttcaagcg ggttgagcca 
gtagatccta gactagagcc ctggaagcat ccaggaagtc agcctaggac cccttgtacc 
aattgctatt gtaaaaagtg ttgccttcat tgccaagttt gtttcacaag^ aaaaggctta 
ggcatctcct atggcaggaa gaagcggaga cagcgacgaa gagctcctca' agacagtcag 
actcatcaag tttctctacc aaagcaaccc tcctcccagc aacgagggga cccgacaggc 
ccgaagaaat ccgtacggga gaagcggctg ctgagcgacg aagagctcct caagacagtc 
agactcatca agtttctcta ccaaagcaac cctcctccca gcaacgaggg gacccgacag 
gcccgaagaa atcgaagaag aaggtggaga gagagacaga ggcagatccg ttcgattagt 
gagcggattc ttagcacttt tctgggacga cctgcggagc ctgtgcctct tcagctaccg 
ccgcttgaga gacttactct tgattgtagc gaagattgtg gaaactctgg gacgcagggg 
gtgggaagtc ctcaagtatt ggtggaatga 

<210> 10 
<211> 663 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Protein comprised of Cytotoxic T-cell epitopes of Pol and Env genes 
(CTL) 

<400> 10 

atgatcaccc tgtggcagcg ccccctggtg gccctgatcg agatctgcac cgagatggag 60 
aaggagggca agatcagcaa gatcggcccc gccggcctga agaagaagaa gagcgtgacc 120 
gtgctggacg tgggcgacgc ctacttcagc gtgcccctgg ataaggactt ccgcaagtac 180 
accgccttca ccatccccag catctggaag ggcagccccg ccatcttcca gagcagcatg 240 
accaagaagc agaaccccga catcgtgatc taccagtaca tggacgacct gtacgtgccc 3 00 
atcgtgctgc ccgagaagga cagctggctg gtgggcaagc tgaactgggc cagccagatc 360 
tacgccggca tcaaggtgaa gcagctgatc ctgaaggagc ccgtgcacgg cgtgtacgag 420 
cccatcgtgg gcgccgagac cttctacgtg gacggcgccg ccaaccgcgc cggcaacctg 480 
tgggtgaccg tgtactacgg cgtgcccgtg tggaaggagg ccaccaccac cctggtggag 540 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1161 



60 
120 
180 
•240 
•300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1170 



-5- 



WO 02/090558 



PCT/FI02/00379 



cgctacctgc gcgaccagca gctgctgggc atctggggct gcgcctgcac cccctacgac 600 
atcaaccaga tgctgcgcgg ccctggccgc gccttcgtga ccatccgcca gggcagcctg 660 
tag 663 



<210> 11 

<211> 1266 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Truncated Gag protein sequence (dgag) 



<400> 11 

atgttagaca aatgggaaaa aattcggtta aggccagggg gaaagaaaaa atatcaatta 
aaacatatag tatgggcaag cagggagcta gaacgattcg cagttaatcc tggcctgtta 
gaaacatcag aaggctgtag acagataatg ggacagctac aaccgtccct tcagacagga 
tcagaagaac ttagatcatt atataataca gtagcaaccc tctattgtgt gcatcaaaag 
atagaggtaa aagacaccaa ggaagcttta gacaaggtag aggaagagca aaacaacagt 
aagaaaaagg cacagcaaga agcagctgac gcaggaaaca gaaaccaggt cagccaaaat 
taccctatag tgcaaaacct acagggacaa atggtacatc aggccatatc acctagaact 
ttaaatgcat gggtaaaagt agtggaagag aaggctttca gcccagaagt aatacccatg 
ttttcagcat tatcagaagg agccacccca caagatttaa acaccatgct aaacacagtg 
gggggacatc aagcagccat gcaaatgtta aaagaaacca tcaatgagga agctgcagaa 
tgggatagat tgcacccagt gcatgcaggg cctattgcac caggccagat gagagaacca 
aggggaagtg acatagcagg aactactagt acccttcagg aacaaatagg atggatgaca 
aataatccac ctatcccagt aggagaaata tataagagat ggataatcct gggattaaat 
aaaatagtaa gaatgtatag ccctaccagc attctggata taaaacaagg accaaaagaa 
ccctttagag attatgtaga ccggttctat aaaaccctaa gagccgagca agctacacag 
gaagtaaaaa attggatgac agaaaccttg ttggtccaaa atgcgaatcc agattgtaag 
actattttaa aagcattagg accagcagct acactagaag aaatgatgac agcatgtcag 
ggagtggggg gacccggcca taaagcaaga gttttggctg aagcaatgag ccaagtaaca 
ggttcagctg ccataatgat gcagagaggc aattttagga accaaagaaa gactgttaag 
tgtttcaatt gtggcaaaga agggcacata gccagaaatt gcagggcccc taggaaaaag 
ggctgttgga aatgtggaaa ggaaggacat caaatgaagg attgcacaga aagacaggct 
aattag 

<210> 12 
<211> 1092 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic coding sequence for pl7/24 protein of Gag gene (syn 17/24) 
<400> 12 

atgggcgcaa gagcctccgt gctgagcggc ggagagctgg acaagtggga gaagatccgc 60 

ctgcgccccg gcggcaagaa gaagtaccag ctgaagcaca tcgtgtgggc cagccgcgag 120 

ctggagcgct tcgccgtgaa ccccggcctg ctcgagacca gcgaaggctg ccgccagatc 180 

atgggccagc tccagcccag cctccagacc ggcagcgagg agctgcgcag cctgtacaac 240 

accgtggcca ccctgtactg cgtgcaccag aagatcgagg tgaaggacac caaggaggcc 300 

ctggacaagg tggaggagga gcagaacaac agcaagaaga aggcccagca ggaggccgcc 360 

gacgccggca accgcaacca ggtgagccag aactacccca tcgtgcagaa cctgcagggc 420 

cagatggtgc accaggccat cagcccccgc accctgaacg cctgggtgaa ggtggtggag 480 

gagaaggcct tcagccccga ggtgatcccc atgttcagcg ccctgagcga gggcgctacc 540 

ccccaggacc tgaacaccat gctgaacacc gtgggcggcc accaggccgc catgcagatg 600 

ctgaaggaga ccatcaacga ggaggccgcc gagtgggacc gcctgcaccc cgtgcacgcc 660 

gggcccatcg cccccggcca gatgcgcgag ccccgcggca gcgacatcgc cggcaccacc 720 

agcaccctcc aggagcagat cggctggatg accaacaacc cccccatccc cgtgggcgag 780 

atctacaagc gctggatcat cctgggcctg aacaagatcg tccgcatgta cagccccacc 840 

agcatcctgg acatcaagca gggccccaag gagcccttcc gcgactacgt ggaccgcttc 900 

tacaagaccc tgcgcgccga gcaggccacc caggaggtga agaactggat gaccgagacc 960 

ctgctggtgc agaacgccaa ccccgactgc aagaccatcc tcaaggccct gggacccgcc 1020 

gccaccctgg aggagatgat gaccgcctgc caaggcgtgg gcggccccgg ccacaaggcc 1080 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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720 
780 
840 
900 
960 
1020 
1080 
1140 
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cgcgtgctgt ga 1092 

<210> 13 
<211> 1092 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic coding sequence for pl7/24 protein of Gag gene optimized 
for expression in eukaryotic cells (optpl7/24) 

<400> 13 

atgggcgcaa gagcctccgt gctgagcggc ggagagctgg acaagtggga gaagatccgc 60 

ctgcgccccg gcggcaagaa gaagtaccag ctgaagcaca tcgtgtgggc cagccgcgag 120 

ctggagcgct tcgccgtgaa ccccggcctg ctcgagacca gcgaaggctg ccgccagatc 180 

atgggccagc tccagcccag cctccagacc ggcagcgagg agctgcgcag cctgtacaac 240 

accgtggcca ccctgtactg cgtgcaccag aagatcgagg tgaaggacac caaggaggcc 300 

ctggacaagg tggaggagga gcagaacaac agcaagaaga aggcccagca ggaggccgcc 360 

gacgccggca accgcaacca agtcagccag aactacccca tcgtgcagaa cctgcagggc 420 

cagatggtgc accaggccat cagcccccgc accctgaacg cctgggtgaa ggtggtggag 480 

gagaaggcct tcagccccga ggtgatcccc atgttcagcg ccctgagcga gggcgctacc 540 

ccccaggacc tgaacaccat gctgaacacc gtgggcggcc accaggccgc catgcagatg 600 

ctgaaggaga ccatcaacga ggaggccgcc gagtgggacc gcctgcaccc cgtgcacgcc 660 

gggcccatcg cccccggcca gatgcgcgag ccccgcggca gcgacatcgc cggcaccacc 720 

agcaccctcc aggagcagat cggctggatg accaacaacc cccccatccc cgtgggcgag 780 

atctacaagc gctggatcat cctgggcctg aacaagatcg tccgcatgta cagccccacc 840 

agcatcctgg acatcaagca gggccccaag gagcccttcc gcgactacgt ggaccgcttc 900 

tacaagaccc tgcgcgccga gcaggccacc caggaggtga agaactggat gaccgagacc 960 

ctgctggtgc agaacgccaa ccccgactgc aagaccatcc tcaaggccct gggacccgcc 1020 

gccaccctgg aggagatgat gaccgcctgc caaggcgtgg gcggccccgg ccacaaggcc 1*080 

cgcgtgctgt ga " ~ ~ 3 1092 

<210> 14 
<211> 1926 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Hybrid protein cds comprised of Tat-Rev-Nef and CTL (TRN-CTL) 
<400> 14 

atggagccag tagatcctag actagagccc tggaagcatc caggaagtca gcctaggacc 60 

ccttgtacca attgctattg taaaaagtgt tgccttcatt gccaagtttg tttcacaaga 120 

aaaggcttag gcatctccta tggcaggaag aagcggagac agcgacgaag agctcctcaa 180 

gacagtcaga ctcatcaagt ttctctacca aagcaaccct cctcccagca acgaggggac 240 

ccgacaggcc cgaagaaatc gaagaagaag gtggagagag agacagaggc agatccgttc 300 

gatactagtg caggaagaag cggagacagc gacgaagagc tcctcaagac agtcagactc 360 

atcaagtttc tctaccaaag caaccctcct cccagcaacg aggggacccg acaggcccga 420 

agaaatcgaa gaagaaggtg gagagagaga cagaggcaga tccgttcgat tagtgagcgg 480 

attcttagca cttttctggg acgacctgcg gagcctgtgc ctcttcagct accgccgctt 540 

gagagactta ctcttgattg tagcgaagat tgtggaaact ctgggacgca gggggtggga 600 

agtcctcaag tattggtgga atctcctgca gtattggagc caggaactaa agaaaagctt 660 

gtgggcaagt ggtcaaaatg tagtggatgg cctactgtaa gggaaagaat gaaacaagct 720 

gagcctgagc cagcagcaga tggggtggga gcagcatctc gagacctgga aaaacatgga 780 

gcaatcacaa gtagcaatac agcaactaat aacgctgctt gtgcctggct agaagcacaa 840 

gaggaagagg aagtgggttt tccagtcaga cctcaggtac ctttaagacc aatgacttac 900 

aagggagctt tagatcttag ccacttttta aaagaaaagg ggggactgga agggttaatt 960 

tactccccaa aaagacaaga gatccttgat ctgtgggtct accacacaca aggctacttc 1020 

cctgattggc agaactacac accagggcca ggggtcagat atccactgac ctttggatgg 1080 

tgcttcaagt tagtaccagt tgaaecagat gaagaagaga acagcagcct gttacaccct 1140 

gcgagcctgc atgggacaga ggacacggag agagaagtgt taaagtggaa gtttgacagc 1200 

catctagcat ttcatcacaa ggcccgagag ctgcatccgg agtactacaa agactgcgcg 1260 

gccgtcatca ccctgtggca gcgccccctg gtggccctga tcgagatctg caccgagatg 1320 
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gagaaggagg gcaagatcag caagatcggc cccgccggcc tgaagaagaa gaagagcgtg 1380 

accgtgctgg acgtgggcga cgcctacttc agcgtgcccc tggataagga cttccgcaag 1440 

tacaccgcct tcaccatccc cagcatctgg aagggcagcc ccgccatctt ccagagcagc 1500 

atgaccaaga agcagaaccc cgacatcgtg atctaccagt acatggacga cctgtacgtg 1560 

cccatcgtgc tgcccgagaa ggacagctgg ctggtgggca agctgaactg ggccagccag 1620 

atctacgccg gcatcaaggt gaagcagctg atcctgaagg agcccgtgca cggcgtgtac 1680 

gagcccatcg tgggcgccga gaccttctac gtggacggcg ccgccaaccg cgccggcaac 1740 

ctgtgggtga ccgtgtacta cggcgtgccc gtgtggaagg aggccaccac caccctggtg 1800 

gagcgctacc tgcgcgacca gcagctgctg ggcatctggg gctgcgcctg caccccctac 1860 

gacatcaacc agatgctgcg cggccctggc cgcgccttcg tgaccatccg ccagggcagc 1920 

ctgtag 1926 

<210> 15 
<211> 1926 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Hybrid protein cds comprised of Rev-Nef-Tat and CTL (RNT-CTL) 



<400> 15 

atggcaggaa gaagcggaga cagcgacgaa gagctcctca agacagtcag actcatcaag 60 

tttctctacc aaagcaaccc tcctcccagc aacgagggga cccgacaggc ccgaagaaat 120 

cgaagaagaa ggtggagaga gagacagagg cagatccgtt cgattagtga gcggattctt 180 

agcacttttc tgggacgacc tgcggagcct gtgcctcttc agctaccgcc gcttgagaga 240 

cttactcttg attgtagcga agattgtgga aactctggga cgcagggggt gggaagtcct 300 

caagtattgg tggaatctcc tgcagtattg gagccaggaa ctaaagaaac tagtgtgggc 360 

aagtggtcaa aatgtagtgg atggcctact gtaagggaaa gaatgaaaca agctgagcct 420 

gagccagcag cagatggggt gggagcagca tctcgagacc tggaaaaaca tggagcaatc 480 

acaagtagca atacagcaac taataacgct gcttgtgcct ggctagaagc acaagaggaa 540 

gaggaagtgg gttttccagt cagacctcag gtacctttaa gaccaatgac ttacaaggga 600 

gctttagatc ttagccactt tttaaaagaa aaggggggac tggaagggtt aatttactcc 660 

ccaaaaagac aagagatcct tgatctgtgg gtctaccaca cacaaggcta cttccctgat 720 

tggcagaact acacaccagg gccaggggtc agatatccac tgacctttgg atggtgcttc 780 

aagttagtac cagttgaacc agatgaagaa gagaacagca gcctgttaca ccctgcgagc 840 

ctgcatggga cagaggacac ggagagagaa gtgttaaagt ggaagtttga cagccatcta 900 

gcatttcatc acaaggcccg agagctgcat ccggagtact acaaagactg caagcttgag 960 

ccagtagatc ctagactaga gccctggaag catccaggaa gtcagcctag gaccccttgt 1020 

accaattgct attgtaaaaa gtgttgcctt cattgccaag tttgtttcac aagaaaaggc 1080 

ttaggcatct cctatggcag gaagaagcgg agacagcgac gaagagctcc tcaagacagt 1140 

cagactcatc aagtttctct accaaagcaa ccctcctccc agcaacgagg ggacccgaca 1200 

ggcccgaaga aatcgaagaa gaaggtggag agagagacag aggcagatcc gttcgatgcg 1260 

gccgtcatca ccctgtggca gcgccccctg gtggccctga tcgagatctg caccgagatg 1320 

gagaaggagg gcaagatcag caagatcggc cccgccggcc tgaagaagaa gaagagcgtg 1380 

accgtgctgg acgtgggcga cgcctacttc agcgtgcccc tggataagga cttccgcaag 1440 

tacaccgcct tcaccatccc cagcatctgg aagggcagcc ccgccatctt ccagagcagc 1500 

atgaccaaga agcagaaccc cgacatcgtg atctaccagt acatggacga cctgtacgtg 1560 

cccatcgtgc tgcccgagaa ggacagctgg ctggtgggca agctgaactg ggccagccag 1620 

atctacgccg gcatcaaggt gaagcagctg atcctgaagg agcccgtgca cggcgtgtac 1680 

gagcccatcg tgggcgccga gaccttctac gtggacggcg ccgccaaccg cgccggcaac 1740 

ctgtgggtga ccgtgtacta cggcgtgccc gtgtggaagg aggccaccac caccctggtg 1800 

gagcgctacc tgcgcgacca gcagctgctg ggcatctggg gctgcgcctg caccccctac 1860 

gacatcaacc agatgctgcg cggccctggc cgcgccttcg tgaccatccg ccagggcagc 1920 

ctgtag 1926 

<210> 16 
<211> 2529 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Hybrid protein cds comprised of Tat-Rev-Nef and truncated Gag 
protein (TRN-dgag) 
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<400> 16 

atggagccag tagatcctag actagagccc tggaagcatc caggaagtca gcctaggacc 60 

ccttgtacca attgctattg taaaaagtgt tgccttcatt gccaagtttg tttcacaaga 120 

aaaggcttag gcatctccta tggcaggaag aagcggagac agcgacgaag agctcctcaa 180 

gacagtcaga ctcatcaagt ttctctacca aagcaaccct cctcccagca acgaggggac 240 

ccgacaggcc cgaagaaatc gaagaagaag gtggagagag agacagaggc agatccgttc 300 

gatactagtg caggaagaag cggagacagc gacgaagagc tcctcaagac agtcagactc 3 60 

atcaagtttc tctaccaaag caaccctcct cccagcaacg aggggacccg acaggcccga 420 

agaaatcgaa gaagaaggtg gagagagaga cagaggcaga tccgttcgat tagtgagcgg 480 

attcttagca cttttctggg acgacctgcg gagcctgtgc ctcttcagct accgccgctt 540 

gagagactta ctcttgattg tagcgaagat tgtggaaact ctgggacgca gggggtggga 600 

agtcctcaag tattggtgga atctcctgca gtattggagc caggaactaa agaaaagctt 660 

gtgggcaagt ggtcaaaatg tagtggatgg cctactgtaa gggaaagaat gaaacaagct 720 

gagcctgagc cagcagcaga tggggtggga gcagcatctc gagacctgga aaaacatgga 780 

gcaatcacaa gtagcaatac agcaactaat aacgctgctt gtgcctggct agaagcacaa 840 

gaggaagagg aagtgggttt tccagtcaga cctcaggtac ctttaagacc aatgacttac 900 

aagggagctt tagatcttag ccacttttta aaagaaaagg ggggactgga agggttaatt 960 

tactccccaa aaagacaaga gatccttgat ctgtgggtct accacacaca aggctacttc 1020 

cctgattggc agaactacac accagggcca ggggtcagat atccactgac ctttggatgg 1080 

tgcttcaagt tagtaccagt tgaaccagat gaagaagaga acagcagcct gttacaccct 1140 

gcgagcctgc atgggacaga ggacacggag agagaagtgt taaagtggaa gtttgacagc 1200 

catctagcat ttcatcacaa ggcccgagag ctgcatccgg agtactacaa agactgcgcg 1260 

gccgtgttag acaaatggga aaaaattcgg ttaaggccag ggggaaagaa aaaatatcaa 1320 

ttaaaacata tagtatgggc aagcagggag ctagaacgat tcgcagttaa tcctggcctg 1380 

ttagaaacat cagaaggctg tagacagata atgggacagc tacaaccgtc ccttcagaca 1440 

ggatcagaag aacttagatc attatataat acagtagcaa ccctctattg tgtgcatcaa 1-500 

aagatagagg taaaagacac caaggaagct ttagacaagg tagaggaaga gcaaaacaac 1560 

agtaagaaaa aggcacagca agaagcagct gacgcaggaa acagaaacca ggtcagccaa 1620 

aattacccta tagtgcaaaa cctacaggga caaatggtac atcaggccat atcacctaga 1680 

actttaaatg catgggtaaa agtagtggaa gagaaggctt tcagcccaga agtaataccc 1740 

atgttttcag cattatcaga aggagccacc ccacaagatt taaacaccat gctaaacaca 1800 

gtggggggac atcaagcagc catgcaaatg ttaaaagaaa ccatcaatga ggaagctgca i860 

gaatgggata gattgcaccc agtgcatgca gggcctattg caccaggcca gatgagagaa 1920 

ccaaggggaa gtgacatagc aggaactact agtacccttc aggaacaaat aggatggatg 1980 

acaaataatc cacctatccc agtaggagaa atatataaga gatggataat cctgggatta 2040 

aataaaatag taagaatgta tagccctacc agcattctgg atataaaaca aggaccaaaa 2:100 

gaacccttta gagattatgt agaccggttc tataaaaccc taagagccga gcaagctaca 2160 

caggaagtaa aaaattggat gacagaaacc ttgttggtcc aaaatgcgaa tccagattgt 2220 

aagactattt taaaagcatt aggaccagca gctacactag aagaaatgat gacagcatgt 2280 

cagggagtgg ggggacccgg ccataaagca agagttttgg ctgaagcaat gagccaagta 2340 

acaggttcag ctgccataat gatgcagaga ggcaatttta ggaaccaaag aaagactgtt 2.400 

aagtgtttca attgtggcaa agaagggcac atagccagaa attgcagggc ccctaggaaa 2460 

aagggctgtt ggaaatgtgg aaaggaagga catcaaatga aggattgcac agaaagacaq 2520 

gctaattag ~ 2529 

<210> 17 
<211> 3195 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Hybrid protein cds comprised of Tat-Rev-Nef, CT1» and truncated Gaq 

protein (TRN-CTL-dgag) y 
<400> 17 

atggagccag tagatcctag actagagccc tggaagcatc caggaagtca gcctaggacc 60 

ccttgtacca attgctattg taaaaagtgt tgccttcatt gccaagtttg tttcacaaga 120 

aaaggcttag gcatctccta tggcaggaag aagcggagac agcgacgaag agctcctcaa 18 0 

gacagtcaga ctcatcaagt ttctctacca aagcaaccct cctcccagca acgaggggac 240 

ccgacaggcc cgaagaaatc gaagaagaag gtggagagag agacagaggc agatccgttc 300 

gatactagtg caggaagaag cggagacagc gacgaagagc tcctcaagac agtcagactc 360 

atcaagtttc tctaccaaag caaccctcct cccagcaacg aggggacccg acaggcccga 420 

agaaatcgaa gaagaaggtg gagagagaga cagaggcaga tccgttcgat tagtgagcgg 480 



-9- 



WO 02/090558 



PCT/F102/00379 



attcttagca 

gagagactta 

agtcctcaag 

gtgggcaagt 

gagcctgagc 

gcaatcacaa 

gaggaagagg 

aagggagctt 

tactccccaa 

cctgattggc 

tgcttcaagt 

gcgagcctgc 

catctagcat 

gccgtcatca 

gagaaggagg 

accgtgctgg 

tacaccgcct 

atgaccaaga 

cccatcgtgc 

atctacgccg 

gagcccatcg 

ctgtgggtga 

gagcgctacc 

gacatcaacc 

ctggcggccg 

tatcaattaa 

ggcctgttag 

cagacaggat 

catcaaaaga 

aacaacagta 

agccaaaatt 

cctagaactt 

atacccatgt 

aacacagtgg 

gctgcagaat 

agagaaccaa 

tggatgacaa 

ggattaaata 

ccaaaagaac 

gctacacagg 

gattgtaaga 

gcatgtcagg 

caagtaacag 

actgttaagt 

aggaaaaagg 

agacaggcta 



cttttctggg 

ctcttgattg 

tattggtgga 

ggtcaaaatg 

cagcagcaga 

gtagcaatac 

aagtgggttt 

tagatcttag 

aaagacaaga 

agaactacac 

tagtaccagt 

atgggacaga 

ttcatcacaa 

ccctgtggca 

gcaagatcag 

acgtgggcga 

tcaccatccc 

agcagaaccc 

tgcccgagaa 

gcatcaaggt 

tgggcgccga 

ccgtgtacta 

tgcgcgacca 

agatgctgcg 

tgttagacaa 

aacatatagt 

aaacatcaga 

cagaagaact 

tagaggtaaa 

agaaaaaggc 

accctatagt 

taaatgcatg 

tttcagcatt 

ggggacatca 

gggatagatt 

ggggaagtga 

ataatccacc 

aaatagtaag 

cctttagaga 

aagtaaaaaa 

ctattttaaa 

gagtgggggg 

gttcagctgc 

gtttcaattg 

gctgttggaa 

at tag 



acgacctgcg 

tagcgaagat 

atctcctgca 

tagtggatgg 

tggggtggga 

agcaactaat 

tccagtcaga 

ccacttttta 

gatccttgat 

accagggcca 

tgaaccagat 

ggacacggag 

ggcccgagag 

gcgccccctg 

caagatcggc 

cgcctacttc 

cagcatctgg 

cgacatcgtg 

ggacagctgg 

gaagcagctg 

gaccttctac 

cggcgtgccc 

gcagctgctg 

cggccctggc 

atgggaaaaa 

atgggcaagc 

aggctgtaga 

tagatcatta 

agacaccaag 

acagcaagaa 

gcaaaaccta 

ggtaaaagta 

atcagaagga 

agcagccatg 

gcacccagtg 

catagcagga 

tatcccagta 

aatgtatagc 

ttatgtagac 

ttggatgaca 

agcattagga 

acccggccat 

cataatgatg 

tggcaaagaa 

atgtggaaag 



gagcctgtgc 

tgtggaaact 

gtattggagc 

cctactgtaa 

gcagcatctc 

aacgctgctt 

cctcaggtac 

aaagaaaagg 

ctgtgggtct 

ggggtcagat 

gaagaagaga 

agagaagtgt 

ctgcatccgg 

gtggccctga 

cccgccggcc 

agcgtgcccc 

aagggcagcc 

atctaccagt 

ctggtgggca 

atcctgaagg 

gtggacggcg 

gtgtggaagg 

ggcatctggg 

cgcgccttcg 

attcggttaa 

agggagctag 

cagataatgg 

tataatacag 

gaagctttag 

gcagctgacg 

cagggacaaa 

gtggaagaga 

gccaccccac 

caaatgttaa 

catgcagggc 

actactagta 

ggagaaatat 

cctaccagca 

cggttctata 

gaaaccttgt 

ccagcagcta 

aaagcaagag 

cagagaggca 

gggcacatag 

gaaggacatc 



ctcttcagct 

ctgggacgca 

caggaactaa 

gggaaagaat 

gagacctgga 

gtgcctggct 

ctttaagacc 

ggggactgga 

accacacaca 

atccactgac 

acagcagcct 

taaagtggaa 

agtactacaa 

tcgagatctg 

tgaagaagaa 

tggataagga 

ccgccatctt 

acatggacga 

agctgaactg 

agcccgtgca 

ccgccaaccg 

aggccaccac 

gctgcgcctg 

tgaccatccg 

ggccaggggg 

aacgattcgc 

gacagctaca 

tagcaaccct 

acaaggtaga 

caggaaacag 

tggtacatca 

aggctttcag 

aagatttaaa 

aagaaaccat 

ctattgcacc 

cccttcagga 

ataagagatg 

ttctggatat 

aaaccctaag 

tggtccaaaa 

cactagaaga 

ttttggctga 

attttaggaa 

ccagaaattg 

aaatgaagga 



accgccgctt 
gggggtggga 
agaaaagctt 
gaaacaagct 
aaaacatgga 
agaagcacaa 
aatgacttac 
agggttaatt 
aggctacttc 
ctttggatgg 
gttacaccct 
gtttgacagc 
agactgcgcg 
caccgagatg 
gaagagcgtg 
cttccgcaag 
ccagagcagc 
cctgtacgtg 
ggccagccag 
cggcgtgtac 
cgccggcaac 
caccctggtg 
caccccctac 
ccagggcagc 
aaagaaaaaa 
agttaatcct 
accgtccctt 
ctattgtgtg 
ggaagagcaa 
aaaccaggtc 
ggccatatca 
cccagaagta 
caccatgcta 
caatgaggaa 
aggccagatg 
acaaatagga 
gataatcctg 
aaaacaagga 
agccgagcaa 
tgcgaatcca 
aatgatgaca 
agcaatgagc 
ccaaagaaag 
cagggcccct 
ttgcacagaa 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
' 2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3195 



<210> 18 

<211> 3195 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Hybrid protein cds comprised of Rev-Nef-Tat, CTL and truncated Gag 
protein (RNT-CTL-dgag) 

<400> 18 

atggcaggaa gaagcggaga cagcgacgaa gagctcctca agacagtcag actcatcaag 60 

tttctctacc aaagcaaccc tcctcccagc aacgagggga cccgacaggc ccgaagaaat 120 

cgaagaagaa ggtggagaga gagacagagg cagatccgtt cgattagtga gcggattctt 180 

agcacttttc tgggacgacc tgcggagcct gtgcctcttc agctaccgcc" gcttgagaga 240 

cttactcttg attgtagcga agattgtgga aactctggga cgcagggggt gggaagtcct 300 

caagtattgg tggaatctcc tgcagtattg gagccaggaa ctaaagaaac tagtgtgggc 360 
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aagtggtcaa aatgtagtgg atggcctact gtaagggaaa gaatgaaaca agctgagcct 420 

gagccagcag cagatggggt gggagcagca tctcgagacc tggaaaaaca tggagcaatc 480 

acaagtagca atacagcaac taataacgct gcttgtgcct ggctagaagc acaagaggaa 540 

gaggaagtgg gttttccagt cagacctcag gtacctttaa gaccaatgac ttacaaggga 600 

gctttagatc ttagccactt tttaaaagaa aaggggggac tggaagggtt aatttactcc 660 

ccaaaaagac aagagatcct tgatctgtgg gtctaccaca cacaaggcta cttccctgat 720 

tggcagaact acacaccagg gccaggggtc agatatccac tgacctttgg atggtgcttc 780 

aagttagtac cagttgaacc agatgaagaa gagaacagca gcctgttaca ccctgcgagc 840 

ctgcatggga cagaggacac ggagagagaa gtgttaaagt ggaagtttga cagccatcta 900 

gcatttcatc acaaggcccg agagctgcat ccggagtact acaaagactg caagcttgag 960 

ccagtagatc ctagactaga gccctggaag catccaggaa gtcagcctag gaccccttgt 1020 

accaattgct attgtaaaaa gtgttgcctt cattgccaag tttgtttcac aagaaaaggc 1080 

ttaggcatct cctatggcag gaagaagcgg agacagcgac gaagagctcc tcaagacagt 1140 

cagactcatc aagtttctct accaaagcaa ccctcctccc agcaacgagg ggacccgaca 1200 

ggcccgaaga aatcgaagaa gaaggtggag agagagacag aggcagatcc gfctcgatgcg 1260 

gccgtcatca ccctgtggca gcgccccctg gtggccctga tcgagatctg caccgagatg 1320 

gagaaggagg gcaagatcag caagatcggc cccgccggcc tgaagaagaa gaagagcgtg 1380 

accgtgctgg acgtgggcga cgcctacttc agcgtgcccc tggataagga cttccgcaag 1440 

tacaccgcct tcaccatccc cagcatctgg aagggcagcc ccgccatctt ccagagcagc 1500 

atgaccaaga agcagaaccc cgacatcgtg atctaccagt acatggacga cctgtacgtg 1560 

cccatcgtgc tgcccgagaa ggacagctgg ctggtgggca agctgaactg ggccagccag 1620 

atctacgccg gcatcaaggt gaagcagctg atcctgaagg agcccgtgca cggcgtgtac 1680 

gagcccatcg tgggcgccga gaccttctac gtggacggcg ccgccaaccg cgccggcaac 1740 

ctgtgggtga ccgtgtacta cggcgtgccc gtgtggaagg aggccaccac caccctggtg 1800 

gagcgctacc tgcgcgacca gcagctgctg ggcatctggg gctgcgcctg caccccctac 1860 

gacatcaacc agatgctgcg cggccctggc cgcgccttcg tgaccatccg ccagggcagc 1920 

ctggcggccg tgttagacaa atgggaaaaa attcggttaa ggccaggggg aaagaaaaaa 1980 

tatcaattaa aacatatagt atgggcaagc agggagctag aacgattcgc agttaatcct '2040 

ggcctgttag aaacatcaga aggctgtaga cagataatgg gacagctaca accgtccctt 2100 

cagacaggat cagaagaact tagatcatta tataatacag tagcaaccct ctattgtgtg 2160 

catcaaaaga tagaggtaaa agacaccaag gaagctttag acaaggtaga ggaagagcaa 2220 

aacaacagta agaaaaaggc acagcaagaa gcagctgacg caggaaacag aaaccaggtc 2280 

agccaaaatt accctatagt gcaaaaccta cagggacaaa tggtacatca ggccatatca 2340 

cctagaactt taaatgcatg ggtaaaagta gtggaagaga aggctttcag cccagaagta 2400 

atacccatgt tttcagcatt atcagaagga gccaccccac aagatttaaa caccatgcta 2460 

aacacagtgg ggggacatca agcagccatg caaatgttaa aagaaaccat caatgaggaa 2520 

gctgcagaat gggatagatt gcacccagtg catgcagggc ctattgcacc aggccagatg 2580 

agagaaccaa ggggaagtga catagcagga actactagta cccttcagga acaaatagga 2640 

tggatgacaa ataatccacc tatcccagta ggagaaatat ataagagatg gataatcctg 2700 

ggattaaata aaatagtaag aatgtatagc cctaccagca ttctggatat aaaacaagga 2760 

ccaaaagaac cctttagaga ttatgtagac cggttctata aaaccctaag agccgagcaa 2820 

gctacacagg aagtaaaaaa ttggatgaca gaaaccttgt tggtccaaaa tgcgaatcca 2880 

gattgtaaga ctattttaaa agcattagga ccagcagcta cactagaaga aatgatgaca 2940 

gcatgtcagg gagtgggggg acccggccat aaagcaagag ttttggctga agcaatgagc 3000 

caagtaacag gttcagctgc cataatgatg cagagaggca attttaggaa ccaaagaaag 3060 

actgttaagt gtttcaattg tggcaaagaa gggcacatag ccagaaattg cagggcccct 3120 

aggaaaaagg gctgttggaa atgtggaaag gaaggacatc aaatgaagga ttgcacagaa 3180 

agacaggcta at tag 3195 

<210> 19 
<211> 3195 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Hybrid protein cds comprised of Tat-Rev-Nef , truncated Gag protein 
and CTL (TRN-dgag-CTL) 

<400> 19 

atggagccag tagatcctag actagagccc tggaagcatc caggaagtca gcctaggacc 60 

ccttgtacca attgctattg taaaaagtgt tgccttcatt gccaagtttg tttcacaaga 120 

aaaggcttag gcatctccta tggcaggaag aagcggagac agcgacgaag agctcctcaa 180 

gacagtcaga ctcatcaagt ttctctacca aagcaaccct cctcccagca acgaggggac 240 
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900 
960 
1020 



ccgacaggcc cgaagaaatc gaagaagaag gtggagagag agacagaggc agatccgttc 300 

gatactagtg caggaagaag cggagacagc gacgaagagc tcctcaagac agtcagactc 360 

atcaagtttc tctaccaaag caaccctcct cccagcaacg aggggacccg acaggcccga 420 

agaaatcgaa gaagaaggtg gagagagaga cagaggcaga tccgttcgat tagtgagcgg 480 

attcttagca cttttctggg acgacctgcg gagcctgtgc ctcttcagct accgccgctt 540 

gagagactta ctcttgattg tagcgaagat tgtggaaact ctgggacgca gggggtggga 600 

agtcctcaag tattggtgga atctcctgca gtattggagc caggaactaa agaaaagctt 660 

gtgggcaagt ggtcaaaatg tagtggatgg cctactgtaa gggaaagaat gaaacaagct 720 

gagcctgagc cagcagcaga tggggtggga gcagcatctc gagacctgga aaaacatgga 7 80 

gcaatcacaa gtagcaatac agcaactaat aacgctgctt gtgcctggct agaagcacaa 840 

gaggaagagg aagtgggttt tccagtcaga cctcaggtac ctttaagacc aatgacttac Q ™ 
aagggagctt tagatcttag ccacttttta aaagaaaagg ggggactgga agggttaatt 
tactccccaa aaagacaaga gatccttgat ctgtgggtct accacacaca aggctacttc 

cctgattggc agaactacac accagggcca ggggtcagat atccactgac ctttggatgg 1080 

tgcttcaagt tagtaccagt tgaaccagat gaagaagaga acagcagcct gttacaccct 1140 

gcgagcctgc atgggacaga ggacacggag agagaagtgt taaagtggaa gtttgacagc . 1200 

catctagcat ttcatcacaa ggcccgagag ctgcatccgg agtactacaa agactgcgcg 1260 

gccgtgttag acaaatggga aaaaattcgg ttaaggccag ggggaaagaa aaaatatcaa 1320 

ttaaaacata tagtatgggc aagcagggag ctagaacgat tcgcagttaa tcctggcctg 1380 

ttagaaacat cagaaggctg tagacagata atgggacagc tacaaccgtc ccttcagaca 1440 

ggatcagaag aacttagatc attatataat acagtagcaa ccctctattg tgtgcatcaa 1500 

aagatagagg taaaagacac caaggaagct ttagacaagg tagaggaaga gcaaaacaac 1560 

agtaagaaaa aggcacagca agaagcagct gacgcaggaa acagaaacca ggtcagccaa 1620 

aattacccta tagtgcaaaa cctacaggga caaatggtac atcaggccat atcacctaga 1680 

actttaaatg catgggtaaa agtagtggaa gagaaggctt tcagcccaga agtaataccc 1740 

atgttttcag cattatcaga aggagccacc ccacaagatt taaacaccat gctaaacaca 1800 

gtggggggac atcaagcagc catgcaaatg ttaaaagaaa ccatcaatga ggaagctgca 1860 

gaatgggata gattgcaccc agtgcatgca gggcctattg caccaggcca gatgagagaa 1920 

ccaaggggaa gtgacatagc aggaactact agtacccttc aggaacaaat aggatggatg 1980 

acaaataatc cacctatccc agtaggagaa atatataaga gatggataat cctgggatta 2040 

aataaaatag taagaatgta tagccctacc agcattctgg atataaaaca aggaccaaaa 2100 

gaacccttta gagattatgt agaccggttc tataaaaccc taagagccga gcaagctaca 2160 

caggaagtaa aaaattggat gacagaaacc ttgttggtcc aaaatgcgaa tccagattgt 2220 

aagactattt taaaagcatt aggaccagca gctacactag aagaaatgat gacagcatgt 2280 

cagggagtgg ggggacccgg ccataaagca agagttttgg ctgaagcaat gagccaagta 2340 

acaggttcag ctgccataat gatgcagaga ggcaatttta ggaaccaaag aaagactgtt 2400 

aagtgtttca attgtggcaa agaagggcac atagccagaa attgcagggc ccctaggaaa 2460 

aagggctgtt ggaaatgtgg aaaggaagga catcaaatga aggattgcac agaaagacag 2520 

gctaatgcgg ccgtcatcac cctgtggcag cgccccctgg tggccctgat cgagatctgc 2580 

accgagatgg agaaggaggg caagatcagc aagatcggcc ccgccggcct gaagaagaag 2640 

aagagcgtga ccgtgctgga cgtgggcgac gcctacttca gcgtgcccct ggataaggac 2700 

ttccgcaagt acaccgcctt caccatcccc agcatctgga agggcagccc cgccatcttc 2760 

cagagcagca tgaccaagaa gcagaacccc gacatcgtga tctaccagta catggacgac 2820 

ctgtacgtgc ccatcgtgct gcccgagaag gacagctggc tggtgggcaa gctgaactgg 2880 

gccagccaga tctacgccgg catcaaggtg aagcagctga tcctgaagga gcccgtgcac 2940 

ggcgtgtacg agcccatcgt gggcgccgag accttctacg tggacggcgc cgccaaccgc 3000 

gccggcaacc tgtgggtgac cgtgtactac ggcgtgcccg tgtggaagga ggccaccacc 3060 

accctggtgg agcgctacct gcgcgaccag cagctgctgg gcatctgggg ctgcgcctgc 3120 

accccctacg acatcaacca gatgctgcgc ggccctggcc gcgccttcgt gaccatccgc 3180 

cagggcagcc tgtag 3195 

<210> 20 
<211> 3195 
<212> DNA 

<213> Artificial Sequence 



<223> Hybrid protein cds comprised of Rev-Nef-Tat, truncated Gag protein 
and CTL (RNT-dgag-CTL) 

<400> 20 

atggcaggaa gaagcggaga cagcgacgaa gage tec tea agacagtcag actcatcaag 60 
tttctctacc aaagcaaccc tcctcccagc aacgagggga cccgacaggc ccgaagaaat 12 0 
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cgaagaagaa ggtggagaga gagacagagg cagatccgtt cgattagtga gcggattctt 180 

agcacttttc tgggacgacc tgcggagcct gtgcctcttc agctaccgcc gcttgagaga 240 

cttactcttg attgtagcga agattgtgga aactctggga cgcagggggt gggaagtcct 300 

caagtattgg tggaatctcc tgcagtattg gagccaggaa ctaaagaaac tagtgtgggc 360 

aagtggtcaa aatgtagtgg atggcctact gtaagggaaa gaatgaaaca agctgagcct. 420 

gagccagcag cagatggggt gggagcagca tctcgagacc tggaaaaaca tggagcaatc' 480 

acaagtagca atacagcaac taataacgct gcttgtgcct ggctagaagc acaagaggaa 540 

gaggaagtgg gttttccagt cagacctcag gtacctttaa gaccaatgac ttacaaggga 600 v 

gctttagatc ttagccactt tttaaaagaa aaggggggac tggaagggtt aatttactcc 660 
ccaaaaagac aagagatcct tgatctgtgg gtctaccaca cacaaggcta cttccctgat 720 

tggcagaact acacaccagg gccaggggtc agatatccac tgacctttgg atggtgcttc : 780 

aagttagtac cagttgaacc agatgaagaa gagaacagca gcctgttaca ccctgcgagc 840 

ctgcatggga cagaggacac ggagagagaa gtgttaaagt ggaagtttga cagccatcta 900 

gcatttcatc acaaggcccg agagctgcat ccggagtact acaaagactg caagcttgag 960 

ccagtagatc ctagactaga gccctggaag catccaggaa gtcagcctag gaccccttgt 1020 

accaattgct attgtaaaaa gtgttgcctt cattgccaag tttgtttcac aagaaaaggc . 1080 

ttaggcatct cctatggcag gaagaagcgg agacagcgac gaagagctcc tcaagacagt 1140 

cagactcatc aagtttctct accaaagcaa ccctcctccc agcaacgagg ggacccgaca 1200 

ggcccgaaga aatcgaagaa gaaggtggag agagagacag aggcagatcc gttcgatgcg 1260 

gccgtgttag acaaatggga aaaaattcgg ttaaggccag ggggaaagaa aaaatatcaa 1320 

ttaaaacata tagtatgggc aagcagggag ctagaacgat tcgcagttaa tcctggcctg 1380 

ttagaaacat cagaaggctg tagacagata atgggacagc tacaaccgtc ccttcagaca 1440 

ggatcagaag aacttagatc attatataat acagtagcaa ccctctattg tgtgcatcaa 1500 

aagatagagg taaaagacac caaggaagct ttagacaagg tagaggaaga gcaaaacaac 1560 

agtaagaaaa aggcacagca agaagcagct gacgcaggaa acagaaacca ggtcagccaa* 1620 

aattacccta tagtgcaaaa cctacaggga caaatggtac atcaggccat atcacctaga' 1680 

actttaaatg catgggtaaa agtagtggaa gagaaggctt tcagcccaga agtaataccc 1.740 

atgttttcag cattatcaga aggagccacc ccacaagatt taaacaccat gctaaacaca 1800 

gtggggggac atcaagcagc catgcaaatg ttaaaagaaa ccatcaatga ggaagctgca i860 

gaatgggata gattgcaccc agtgcatgca gggcctattg caccaggcca gatgagagaa 1920 

ccaaggggaa gtgacatagc aggaactact agtacccttc aggaacaaat aggatggatg 1980 

acaaataatc cacctatccc agtaggagaa atatataaga gatggataat cctgggatta 2040 

aataaaatag taagaatgta tagccctacc agcattctgg atataaaaca aggaccaaaa 2100 

gaacccttta gagattatgt agaccggttc tataaaaccc taagagccga gcaagctaca 2160 

caggaagtaa aaaattggat gacagaaacc ttgttggtcc aaaatgcgaa tccagattgt 2220 

aagactattt taaaagcatt aggaccagca gctacactag aagaaatgat gacagcatgt 2280 

cagggagtgg ggggacccgg ccataaagca agagttttgg ctgaagcaat gagccaagta 2340 

acaggttcag ctgccataat gatgcagaga ggcaatttta ggaaccaaag aaagactgtt 2400 

aagtgtttca attgtggcaa agaagggcac atagccagaa attgcagggc ccctaggaaa 2460 

aagggctgtt ggaaatgtgg aaaggaagga catcaaatga aggattgcac agaaagacag 2520 

gctaatgcgg ccgtcatcac cctgtggcag cgccccctgg tggccctgat cgagatctgc 2580 

accgagatgg agaaggaggg caagatcagc aagatcggcc ccgccggcct gaagaagaag 2640 

aagagcgtga ccgtgctgga cgtgggcgac gcctacttca gcgtgcccct ggataaggac 2700 

ttccgcaagt acaccgcctt caccatcccc agcatctgga agggcagccc cgccatcttc 2760 

cagagcagca tgaccaagaa gcagaacccc gacatcgtga tctaccagta catggacgac 2820 

ctgtacgtgc ccatcgtgct gcccgagaag gacagctggc tggtgggcaa gctgaactgg 2880 

gccagccaga tctacgccgg catcaaggtg aagcagctga tcctgaagga gcccgtgcac 2940 

ggcgtgtacg agcccatcgt gggcgccgag accttctacg tggacggcgc cgccaaccgc 3000 

gccggcaacc tgtgggtgac cgtgtactac ggcgtgcccg tgtggaagga ggccaccacc 3060 

accctggtgg agcgctacct gcgcgaccag cagctgctgg gcatctgggg ctgcgcctgc 3120 

accccctacg acatcaacca gatgctgcgc ggccctggcc gcgccttcgt gaccatccgc 3180 

cagggcagcc tgtag " " 3195 

<210> 21 
<211> 3020 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Hybrid protein cds comprised of Tat-Rev-Nef , truncated Gag protein 
and CTL (TRN-optpl7/24-CTL) 

<400> 21 
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atggagccag tagatcctag actagagccc tggaagcatc caggaagtca gcctaggacc 60 

ccttgtacca attgctattg taaaaagtgt tgccttcatt gccaagtttg tttcacaaga 120 

aaaggcttag gcatctccta tggcaggaag aagcggagac agcgacgaag agctcctcaa 180 

gacagtcaga ctcatcaagt ttctctacca aagcaaccct cctcccagca acgaggggac 240 

ccgacaggcc cgaagaaatc gaagaagaag gtggagagag agacagaggc agatccgttc 300 

gatactagtg caggaagaag cggagacagc gacgaagagc tcctcaagac agtcagactc 360 

atcaagtttc tctaccaaag caaccctcct cccagcaacg aggggacccg acaggcccga 420 

agaaatcgaa gaagaaggtg gagagagaga cagaggcaga tccgttcgat tagtgagcgg 480 

attcttagca cttttctggg acgacctgcg gagcctgtgc ctcttcagct accgccgctt 540 

gagagactta ctcttgattg tagcgaagat tgtggaaact ctgggacgca gggggtggga 600 

agtcctcaag tattggtgga atctcctgca gtattggagc caggaactaa agaaaagctt 660 

gttggcaagt ggtcaaaatg tagtggatgg cctactgtaa gggaaagaat gaaacaagct 720 

gagcctgagc cagcagcaga tggggtggga gcagcatctc gagacctgga aaaacatgga 78 0 

gcaatcacaa gtagcaatac agcaactaat aacgctgctt gtgcctggct agaagcacaa 840 

gaggaagagg aagtgggttt tccagtcaga cctcaggtac ctttaagacc aatgacttac 900 

aagggagctt tagatcttag ccacttttta aaagaaaagg ggggactgga agggttaatt 960 

tactccccaa aaagacaaga gatccttgat ctgtgggtct accacacaca aggctacttc 1020 

cctgattggc agaactacac accagggcca ggggtcagat atccactgac ctttggatgg 1080 

tgcttcaagt tagtaccagt tgaaccagat gaagaagaga acagcagcct gttacaccct 1140 

gcgagcctgc atgggacaga ggacacggag agagaagtgt taaagtggaa gtttgacagc 1200 

catctagcat ttcatcacaa ggcccgagag ctgcatccgg agtactacaa agactgcgcg 1260 

gccgtgggcg caagagcctc cgtgctgagc ggcggagagc tggacaagtg ggagaagatc 1320 

cgcctgcgcc ccggcggcaa gaagaagtac cagctgaagc acatcgtgtg ggccagccgc 1380 

gagctggagc gcttcgccgt gaaccccggc ctgctcgaga ccagcgaagg ctgccgccag 1440 

atcatgggcc agctccagcc cagcctccag accggcagcg aggagctgcg cagcctgtac 1500 

aacaccgtgg ccaccctgta ctgcgtgcac cagaagatcg aggtgaagga caccaaggag 1560 

gccctggaca aggtggagga ggagcagaac aacagcaaga agaaggccca gcaggaggcc 1620 

gccgacgccg gcaaccgcaa ccaagtcagc cagaactacc ccatcgtgca gaacctgcag 1680 

ggccagatgg tgcaccaggc catcagcccc cgcaccctga acgcctgggt gaaggtggtg 1740 

gaggagaagg ccttcagccc cgaggtgatc cccatgttca gcgccctaag cgagggcgct 1800 

accccccagg acctgaacac catgctgaac accgtgggcg gccaccaggc cgccatgcag 1860 

atgctgaagg agaccatcaa cgaggaggcc gccgagtggg accgcctgca ccccgtgcac 1920 

gccgggccca tcgcccccgg ccagatgcgc gagccccgcg gcagcgacat cgccggcacc 1980 

accagcaccc tccaggagca gatcggctgg atgaccaaca acccccccat ccccgtgggc 2040 

gagatctaca agcgctggat catcctgggc ctgaacaaga tcgtccgcat gtacagcccc 2100 

accagcatcc tggacatcaa gcagggcccc aaggagccct tccgcgacta cgtggaccgc 2160 

ttctacaaga ccctgcgcgc cgagcaggcc acccaggagg tgaagaactg gatgaccgag 2220 

accctgctgg tgcagaacgc caaccccgac tgcaagacca tcctcaaggc cctgggaccc 2280 

gccgccaccc tggaggagat gatgaccgcc tgccaaggcg tgggcggccc cggccacaag 2340 

gcccgcgtgc tggcggccgt catcaccctg tggcagcgcc ccctggtggc cctgatcgag 2400 

atctgcaccg agatggagaa ggagggcaag atcagcaaga tcggccccgc cggcctgaag 2460 

aagaagaaga gcgtgaccgt gctggacgtg ggcgacgcct acttcagcgt gcccctggat 252 0 

aaggacttcc gcaagtacac cgccttcacc atccccagca tctggaaggg cagccccgcc 2580 

atcttccaga gcagcatgac caagaagcag aaccccgaca tcgtgatcta ccagtacatg 2640 

gacgacctgt acgtgcccat cgtgctgccc gagaaggaca gctggctggt gggcaagctg 2700 

aactgggcca gccagatcta cgccggcatc aaggtgaagc agctgatcct gaaggagccc 2760 

gtgcacggcg tgtacgagcc catcgtgggc gccgagacct tctacgtgga cggcgccgcc 2820 

aaccgcgccg gcaacctgtg ggtgaccgtg tactacggcg tgcccgtgtg gaaggaggcc 2880 

accaccaccc tggtggagcg ctacctgcgc gaccagcagc tgctgggcat ctggggctgc 2940 

gcctgcaccc cctacgacat caaccagatg ctgcgcggcc ctggccgcgc ctcgtgacca 3000 

tccgccaggg cagcctgtag 3020 

<210> ' 22 
<211> 3021 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Hybrid protein cdscomprised of Tat-Rev-Nef , CTL and truncated Gag 
protein ( TRN- CTL- optpl 7/24 ) 

<400> 22 

atggagccag tagatcctag actagagccc tggaagcatc caggaagtca gcctaggacc 60 
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ccttgtacca attgctattg taaaaagtgt tgccttcatt gccaagtttg tctcacaaga 120 

aaaggcttag gcatctccta tggcaggaag aagcggagac agcgacgaag agctcctcaa 180 

gacagtcaga ctcatcaagt ttctctacca aagcaaccct cctcccagca acgaggggac 240 

ccgacaggcc cgaagaaatc gaagaagaag gtggagagag agacagaggc agatccgttc 300 

gatactagtg caggaagaag cggagacagc gacgaagagc tcctcaagac agtcagactc 360 

atcaagtttc tctaccaaag caaccctcct cccagcaacg aggggacccg acaggcccga 420 

agaaatcgaa gaagaaggtg gagagagaga cagaggcaga tccgttcgat tagtgagcgg 480 

attcttagca cttttctggg acgacctgcg gagcctgtgc ctcttcagct accgccgctt 540 

gagagactta ctcttgattg tagcgaagat tgtggaaact ctgggacgca gggggtggga 600 

agtcctcaag tattggtgga atctcctgca gtattggagc caggaactaa agaaaagctt 660 

gtgggcaagt ggtcaaaatg tagtggatgg cctactgtaa gggaaagaat gaaacaagct 720 

gagcctgagc cagcagcaga tggggtggga gcagcatctc gagacctgga aaaacatgga 780 

gcaatcacaa gtagcaatac agcaactaat aacgctgctt gtgcctggct agaagcacaa 840 

gaggaagagg aagtgggttt tccagtcaga cctcaggtac ctttaagacc aatgacttac 900 

aagggagctt tagatcttag ccacttttta aaagaaaagg ggggactgga agggttaatt 960 

tactccccaa aaagacaaga gatccttgat ctgtgggtct accacacaca aggctacttc 1020 

cctgattggc agaactacac accagggcca ggggtcagat atccactgac ctttggatgg 1080 

tgcttcaagt tagtaccagt tgaaccagat gaagaagaga acagcagcct gttacaccct 1140 

gcgagcctgc atgggacaga ggacacggag agagaagtgt taaagtggaa gtttgacagc 1200 

catctagcat ttcatcacaa ggcccgagag ctgcatccgg agtactacaa agactgcgcg 1260 

gccgtcatca ccctgtggca gcgccccctg gtggccctga tcgagatctg caccgagatg 1320 

gagaaggagg gcaagatcag caagatcggc cccgccggcc tgaagaagaa gaagagcgtg 1380 

accgtgctgg acgtgggcga cgcctacttc agcgtgcccc tggataagga cttccgcaag 1440 

tacaccgcct tcaccatccc cagcatctgg aagggcagcc ccgccatctt ccagagcagc 1500 

atgaccaaga agcagaaccc cgacatcgtg atctaccagt acatggacga cctgtacgtg 1560 

cccatcgtgc tgcccgagaa ggacagctgg ctggtgggca agctgaactg ggccagccag 1620 

atctacgccg gcatcaaggt gaagcagctg atcctgaagg agcccgtgca cggcgtgtac 1680 

gagcccatcg tgggcgccga gaccttctac gtggacggcg ccgccaaccg cgccggcaac 1740 

ctgtgggtga ccgtgtacta cggcgtgccc gtgtggaagg aggccaccac caccctggtg 1800 

gagcgctacc tgcgcgacca gcagctgctg ggcatctggg gctgcgcctg caccccctac 1.860 

gacatcaacc agatgctgcg cggccctggc cgcgccttcg tgaccatccg ccagggcagc 1920 

ctggcggccg tgggcgcaag agcctccgtg ctgagcggcg gagagctgga caagtgggag 1980 
aagatccgcc tgcgccccgg cggcaagaag aagtaccagc tgaagcacat cgtgtgggcc " 2040 

agccgcgagc tggagcgctt cgccgtgaac cccggcctgc tcgagaccag cgaaggctgc 2100 

cgccagatca tgggccagct ccagcccagc ctccagaccg gcagcgagga gctgcgcagc 2-160 ' 

ctgtacaaca ccgtggccac cctgtactgc gtgcaccaga agatcgaggt gaaggacacc 2220 

aaggaggccc tggacaaggt ggaggaggag cagaacaaca gcaagaagaa ggcccagcag 2280 

gaggccgccg acgccggcaa ccgcaaccaa gtcagccaga actaccccat cgtgcagaac 2340 

ctgcagggcc agatggtgca ccaggccatc agcccccgca ccctgaacgc ctgggtgaag 24 00 

gtggtggagg agaaggcctt cagccccgag gtgatcccca tgttcagcgc cctgagcgag 2460 

ggcgctaccc cccaggacct gaacaccatg ctgaacaccg tgggcggcca ccaggccgcc 2520 

atgcagatgc tgaaggagac catcaacgag gaggccgccg agtgggaccg cctgcacccc 2580 

gtgcacgccg ggcccatcgc ccccggccag atgcgcgagc cccgcggcag cgacatcgcc 2640 

ggcaccacca gcaccctcca ggagcagatc ggctggatga ccaacaaccc ccccatcccc 2700 

gtgggcgaga tctacaagcg ctggatcatc ctgggcctga acaagatcgt ccgcatgtac 2760 

agccccacca gcatcctgga catcaagcag ggccccaagg agcccttccg cgactacgtg 2820 

gaccgcctct acaagaccct gcgcgccgag caggccaccc aggaggtgaa gaactggatg 2880 

accgagaccc tgctggtgca gaacgccaac cccgactgca agaccatcct caaggccctg 2940 

ggacccgccg ccaccctgga ggagatgatg accgcctgcc aaggcgtggg cggccccggc 3000 

cacaaggccc gcgtgctgtg a 33 3021 

<210> 23 
<211> 3021 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Hybrid protein cds comprised of Rev-Nef-Tat, CTL and truncated Ga 
protein (RNT-CTI,-optpl7/24) 

<400> 23 

atggcaggaa gaagcggaga cagcgacgaa gagctcctca agacagtcag actcatcaag 60 
tttctctacc aaagcaaccc tcctcccagc aacgagggga cccgacaggc ccgaagaaat 120 



-15- 



WO 02/090558 



PCT/FI02/00379 



cgaagaagaa ggtggagaga gagacagagg cagatccgtt cgattagtga gcggattctt 180 

agcacttttc tgggacgacc tgcggagcct gtgcctcttc agctaccgcc gcttgagaga 240 

cttactcttg attgtagcga agattgtgga aactctggga cgcagggggt gggaagtcct 300 

caagtattgg tggaatctcc tgcagtattg gagccaggaa ctaaagaaac tagtgtgggc 360 

aagtggtcaa aatgtagtgg atggcctact gtaagggaaa gaatgaaaca agctgagcct ; 420 

gagccagcag cagatggggt gggagcagca tctcgagacc tggaaaaaca tggagcaatc 480 

acaagtagca atacagcaac taataacgct gcttgtgcct ggctagaagc acaagaggaa 540 

gaggaagtgg gttttccagt cagacctcag gtacctttaa gaccaatgac ttacaaggga 600 

gctttagatc ttagccactt tttaaaagaa aaggggggac tggaagggtt aatttactcc 660 

ccaaaaagac aagagatcct tgatctgtgg gtctaccaca cacaaggcta cttccctgat •; 720 

tggcagaact acacaccagg gccaggggtc agatatccac tgacctttgg atggtgcttc ' 780 

aagttagtac cagttgaacc agatgaagaa gagaacagca gcctgttaca ccctgcgagc 840 

ctgcatggga cagaggacac ggagagagaa gtgttaaagt ggaagtttga cagccatcta 900 

gcatttcatc acaaggcccg agagctgcat ccggagtact acaaagactg caagcttgag 960 

ccagtagatc ctagactaga gccctggaag catccaggaa gtcagcctag gaccccttgt 1020 
accaattgct attgtaaaaa gtgttgcctt cattgccaag tttgtttcac- aagaaaaggc 1080 

ttaggcatct cctatggcag gaagaagcgg agacagcgac gaagagctcc tcaagacagt 1140 

cagactcatc aagtttctct accaaagcaa ccctcctccc agcaacgagg ggacccgaca 1200 

ggcccgaaga aatcgaagaa gaaggtggag agagagacag aggcagatcc gttcgatgcg 1260 

gccgtcatca ccctgtggca gcgccccctg gtggccctga tcgagatctg caccgagatg 1320 

gagaaggagg gcaagatcag caagatcggc cccgccggcc tgaagaagaa gaagagcgtg 1380 

accgtgctgg acgtgggcga cgcctacttc agcgtgcccc tggataagga cttccgcaag 1440 

tacaccgcct tcaccatccc cagcatctgg aagggcagcc ccgccatctt ccagagcagc 1500 

atgaccaaga agcagaaccc cgacatcgtg atctaccagt acatggacga cctgtacgtg 1560 

cccatcgtgc tgcccgagaa ggacagctgg ctggtgggca agctgaactg ggccagccag,. 1620 

atctacgccg gcatcaaggt gaagcagctg atcctgaagg agcccgtgca cggcgtgtac ' 1680 

gagcccatcg tgggcgccga gaccttctac gtggacggcg ccgccaaccg cgccggcaac 1740 

ctgtgggtga ccgtgtacta cggcgtgccc gtgtggaagg aggccaccac caccctggtg 1800 

gagcgctacc tgcgcgacca gcagctgctg ggcatctggg gctgcgcctg caccccctac 1860 

gacatcaacc agatgctgcg cggccctggc cgcgccttcg tgaccatccg ccagggcagc 1920 

ctggcggccg tgggcgcaag agcctccgtg ctgagcggcg gagagctgga caagtgggag 1980 

aagatccgcc tgcgccccgg. cggcaagaag aagtaccagc tgaagcacat cgtgtgggcc 2040 

agccgcgagc tggagcgctt cgccgtgaac cccggcctgc tcgagaccag cgaaggctgc 2100 

cgccagatca tgggccagct ccagcccagc ctccagaccg gcagcgagga gctgcgcagc 2160 

ctgtacaaca ccgtggccac cctgtactgc gtgcaccaga agatcgaggt gaaggacacc 2220 

aaggaggccc tggacaaggt ggaggaggag cagaacaaca gcaagaagaa ggcccagcag 2280 

gaggccgccg acgccggcaa ccgcaaccaa gtcagccaga actaccccat cgtgcagaac 2340 

ctgcagggcc agatggtgca ccaggccatc agcccccgca ccctgaacgc ctgggtgaag 2400 

gtggtggagg agaaggcctt cagccccgag gtgatcccca tgttcagcgc cctgagcgag 2460 

ggcgctaccc cccaggacct gaacaccatg ctgaacaccg tgggcggcca ccaggccgcc 2520 

atgcagatgc tgaaggagac catcaacgag gaggccgccg agtgggaccg cctgcacccc 2580 

gtgcacgccg ggcccatcgc ccccggccag atgcgcgagc cccgcggcag cgacatcgcc 2640 

ggcaccacca gcaccctcca ggagcagatc ggctggatga ccaacaaccc ccccatcccc 2700 

gtgggcgaga tctacaagcg ctggatcatc ctgggcctga acaagatcgt ccgcatgtac 2760 

agccccacca gcatcctgga catcaagcag ggccccaagg agcccttccg cgactacgtg 2820 

gaccgcttct acaagaccct gcgcgccgag caggccaccc aggaggtgaa gaactggatg 2880 

accgagaccc tgctggtgca gaacgccaac cccgactgca agaccatcct caaggccctg 2940 

ggacccgccg ccaccctgga ggagatgatg accgcctgcc aaggcgtggg cggccccggc 3000 

cacaaggccc gcgtgctgtg a 3021 

<210> 24 
<211> 3021 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Hybrid protein cds comprised of Rev-Nef-Tat, truncated Gag protein 
and CTL (RNT-optpl7/24-CTL) 

<400> 24 

atggcaggaa gaagcggaga cagcgacgaa gage tec tea agacagtcag actcatcaag 60 

tttctctacc aaagcaaccc tcctcccagc aacgagggga cccgacaggc ccgaagaaat 120 

cgaagaagaa ggtggagaga gagacagagg cagatccgtt cgattagtga gcggattctt 180 
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agcacttttc tgggacgacc tgcggagcct gtgcctcttc agctaccgcc gcttgagaga 24 u 

cttactcttg attgtagcga agattgtgga aactctggga cgcagggggt gggaagtcct 300 

caagtattgg tggaatctcc tgcagtattg gagccaggaa ctaaagaaac tagtgtgggc 360 

aagtggtcaa aatgtagtgg atggcctact gtaagggaaa gaatgaaaca agctgagcct 420 

gagccagcag cagatggggt gggagcagca tctcgagacc tggaaaaaca tggagcaatc 480 

acaagtagca atacagcaac taataacgct gcttgtgcct ggctagaagc acaagaggaa 540 

gaggaagtgg gttttccagt cagacctcag gtacctttaa gaccaatgac ttacaaggga 600 

gctttagatc ttagccactt tttaaaagaa aaggggggac tggaagggtt aatttactcc 660 

ccaaaaagac aagagatcct tgatctgtgg gtctaccaca cacaaggcta cttccctgat 720 

tggcagaact acacaccagg gccaggggtc agatatccac tgacctttgg atggtgcttc 780 

aagttagtac cagttgaacc agatgaagaa gagaacagca gcctgttaca ccctgcgagc 840 

ctgcatggga cagaggacac ggagagagaa gtgttaaagt ggaagtttga cagccatcta 900 

gcatttcatc acaaggcccg agagctgcat ccggagtact acaaagactg caagcttgag 960 

ccagtagatc ctagactaga gccctggaag catccaggaa gtcagcctag gaccccttgt 1020 

accaattgct attgtaaaaa gtgttgcctt cattgccaag tttgtttcac aagaaaaggc 1080 

ttaggcatct cctatggcag gaagaagcgg agacagcgac gaagagctcc tcaagacagt 1140 

cagactcatc aagtttctct accaaagcaa ccctcctccc agcaacgagg ggacccgaca 1200 

ggcccgaaga aatcgaagaa gaaggtggag agagagacag aggcagatcc gttcgatgcg 1260 

gccgtgggcg caagagcctc cgtgctgagc ggcggagagc tggacaagtg ggagaagatc 1320 

cgcctgcgcc ccggcggcaa gaagaagtac cagctgaagc acatcgtgtg ggccagccgc 1380 

gagctggagc gcttcgccgt gaaccccggc ctgctcgaga ccagcgaagg ctgccgccag 1440 

atcatgggcc agctccagcc cagcctccag accggcagcg aggagctgcg cagcctgtac 1500 

aacaccgtgg ccaccctgta ctgcgtgcac cagaagatcg aggtgaagga caccaaggag 1560 

gccctggaca aggtggagga ggagcagaac aacagcaaga agaaggccca gcaggaggcc 1620 

gccgacgccg gcaaccgcaa ccaagtcagc cagaactacc ccatcgtgca gaacctgcag 1680 

ggccagatgg tgcaccaggc catcagcccc cgcaccctga acgcctgggt gaaggtggtg 1740 

gaggagaagg ccttcagccc cgaggtgatc cccatgttca gcgccctaag cgagggcgct 1800 

accccccagg acctgaacac catgctgaac accgtgggcg gccaccaggc cgccatgcag i860 

atgctgaagg agaccatcaa cgaggaggcc gccgagtggg accgcctgca ccccgtgcac 1920 

gccgggccca tcgcccccgg ccagatgcgc gagccccgcg gcagcgacat cgccggcacc 1980 

accagcaccc tccaggagca gatcggctgg atgaccaaca acccccccat ccccgtgggc 2040 

gagatctaca agcgctggat catcctgggc ctgaacaaga tcgtccgcat gtacagcccc 2100 

accagcatcc tggacatcaa gcagggcccc aaggagccct tccgcgacta cgtggaccgc 2160 

ttctacaaga ccctgcgcgc cgagcaggcc acccaggagg tgaagaactg gatgaccgag 2220 

accctgctgg tgcagaacgc caaccccgac tgcaagacca tcctcaaggc cctgggaccc 2280 

gecgccaccc tggaggagat gatgaccgcc tgccaaggcg tgggcggccc cggccacaag 2-340 

gcccgcgtgc tggcggccgt catcaccctg tggcagcgcc ccctggtggc cctgatcgag 2400 

atctgcaccg agatggagaa ggagggcaag atcagcaaga tcggccccgc cggcctgaag 2460 

aagaagaaga gcgtgaccgt gctggacgtg ggcgacgcct acttcagcgt gcccctggat 2520 

aaggacttcc gcaagtacac cgccttcacc atccccagca tctggaaggg cagccccgcc 2580 

atcttccaga gcagcatgac caagaagcag aaccccgaca tcgtgatcta ccagtacatg 2640 

gacgacctgt acgtgcccat cgtgctgccc gagaaggaca gctggctggt gggcaagctg 2700 

aactgggcca gccagatcta cgccggcatc aaggtgaagc agctgatcct gaaggagccc 2760 

gtgcacggcg tgtacgagcc catcgtgggc gccgagacct tctacgtgga cggcgccgcc 2820 

aaccgcgccg gcaacctgtg ggtgaccgtg tactacggcg tgcccgtgtg gaaggaggcc 2880 

accaccaccc tggtggagcg ctacctgcgc gaccagcagc tgctgggcat ctggggctgc 2940- 

gcctgcaccc cctacgacat caaccagatg ctgcgcggcc ctggccgcgc cttcgtgacc 3000 

atccgccagg gcagcctgta g ~ 3 021 

<210> 25 
<211> 419 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Hybrid protein comprised of Nef-Tat-Rev (NTR) 
<400> 25 

Met Val Gly Lys Trp Ser Lys Cys Ser Gly Trp Pro Thr Val Arg Glu 

1 5 io 15 

Arg Met Lys Gin Ala Glu Pro Glu Pro Ala Ala Asp Gly Val Gly Ala 

20 25 30 

Ala Ser Arg Asp Leu Glu Lys His Gly Ala He Thr Ser Ser Asn Thr 
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35 40 45 

Ala Thr Asn Asn Ala Ala Cys Ala Trp Leu Glu Ala Gin Glu Glu Glu 

50 55 60 

Glu Val Gly Phe Pro Val Arg Pro Gin Val Pro Leu Arg Pro Met Thr 
65 70 75 80 

Tyr Lys Gly Ala Leu Asp Leu Ser His Phe Leu Lys Glu Lys Gly Gly 

85 90 95 

Leu Glu Gly Leu He Tyr Ser Pro Lys Arg Gin Glu He Leu Asp Leu 

100 105 HO 

Trp Val Tyr His Thr Gin Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr 

115 120 125 

Pro Gly Pro Gly Val Arg Tyr Pro Leu Thr Phe Gly Trp Cys Phe Lys 

130 135 140 

Leu Val Pro Val Glu Pro Asp Glu Glu Glu Asn Ser Ser Leu Leu His 
145 150 155 160 

Pro Ala Ser Leu His Gly Thr Glu Asp Thr Glu Arg Glu Val Leu Lys 

165 170 175 

Trp Lys Phe Asp Ser His Leu Ala Phe His His Lys Ala Arg Glu Leu 

180 185 190 

His Pro Glu Tyr Tyr Lys Asp Cys Thr Ser Ala Gly Arg Ser Gly Asp 

195 200 205 

Ser Asp Glu Glu Leu Leu Lys Thr Val Arg Leu lie Lys Phe Leu Tyr 

210 215 220 

Gin Ser Asn Pro Pro Pro Ser Asn Glu Gly Thr Arg Gin Ala Arg Arg 
225 230 235 240 

Asn Arg Arg Arg Arg Trp Arg Glu Arg Gin Arg Gin He Arg Ser He 

245 250 255 

Ser Glu Arg He Leu Ser Thr Phe Leu Gly Arg Pro Ala Glu Pro Val 

260 265 270 

Pro Leu Gin Leu Pro Pro Leu Glu Arg Leu Thr Leu Asp Cys Ser Glu 

275 280 285 

Asp Cys Gly Asn Ser Gly Thr Gin Gly Val Gly Ser Pro Gin Val Leu 

290 295 300 

Val Glu Ser Pro Ala Val Leu Glu Pro Gly Thr Lys Glu Lys Leu Glu 
305 310 315 - 320 

Pro Val Asp Pro Arg Leu Glu Pro Trp Lys His Pro Gly Ser Gin Pro 

325 330 335 

Arg Thr Pro Cys Thr Asn Cys Tyr Cys Lys Lys Cys Cys Leu His Cys 

340 345 350 

Gin Val Cys Phe Thr Arg Lys Gly Leu Gly He Ser Tyr Gly Arg Lys 

355 360 365 

Lys Arg Arg Gin Arg Arg Arg Ala Pro Gin Asp Ser Gin Thr His Gin 

370 " 375 380 

Val Ser Leu Pro Lys Gin Pro Ser Ser Gin Gin Arg Gly Asp Pro Thr 
385 390 395 400 

Gly Pro Lys Lys Ser Lys Lys Lys Val Glu Arg Glu Thr Glu Ala Asp 
405 ~ ' 410 415 

Pro Phe Asp 



<210> 26 

<211> 419 

<212> PRT 

<213> Artificial 



Sequence 



<220> 

<223> Hybrid protein comprised of Tat-Rev-Nef (TRN) 
<400> 26 

Met Glu Pro Val Asp Pro Arg Leu Glu Pro Trp Lys His Pro Gly Ser 

15 10 15 

Gin Pro Arg Thr Pro Cys Thr Asn Cys Tyr Cys Lys Lys Cys Cys Leu 

20 25 ' 30 

His Cys Gin Val Cys Phe Thr Arg Lys Gly Leu Gly He Ser Tyr Gly 
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35 40 45 

Arg Lys Lys Arg Arg Gin Arg Arg Arg Ala Pro Gin Asp Ser Gin Thr 

. 50 55 60 

His Gin Val Ser Leu Pro Lys Gin Pro Ser Ser Gin Gin Arg Gly Asp 
65 70 75 80 

Pro Thr Gly Pro Lys Lys Ser Lys Lys Lys Val Glu Arg Glu Thr Glu 

65 90 95 

Ala Asp Pro Phe Asp Thr Ser Ala Gly Arg Ser Gly Asp Ser Asp Glu 

100 los no 

Glu Leu Leu Lys Thr Val Arg Leu lie Lys Phe Leu Tyr Gin Ser Asn 

115 120 125 

Pro Pro Pro Ser Asn Glu Gly Thr Arg Gin Ala Arg Arg Asn Arg Arg 
^ 130 135 140 ' 

Arg Arg Trp Arg Glu Arg Gin Arg Gin lie Arg Ser lie Ser Glu Arq 
J? 5 150 155 160 

lie Leu Ser Thr Phe Leu Gly Arg Pro Ala Glu Pro Val Pro Leu Gin 

165 170 175 

Leu Pro Pro Leu Glu Arg Leu Thr Leu Asp Cys Ser Glu Asp Cys Gly 

180 185 190 

Asn Ser Gly Thr Gin Gly Val Gly Ser Pro Gin Val Leu Val Glu Ser 

195 200 205 

Pro Ala Val Leu Glu Pro Gly Thr Lys Glu Lys Leu Val Gly Lys Tro 
o 210 215 ~ 220 ^ 

Ser Lys Cys Ser Gly Trp Pro Thr Val Arg Glu Arg Met Lys Gin Ala 
225 230 235 ' 240 

Glu Pro Glu Pro Ala Ala Asp Gly Val Gly Ala Ala Ser Arg Asp Leu 

245 250 255 

Glu Lys His Gly Ala lie Thr Ser Ser Asn Thr Ala Thr Asn Asn Ala 

260 265 ■ • 270 

Ala Cys Ala Trp Leu Glu Ala Gin Glu Glu Glu Glu Val Gly Phe Pro 

275 280 285 

Val Arg Pro Gin Val Pro Leu Arg Pro Met Thr Tyr Lys Gly Ala Leu 
a 290 295 300 

Asp Leu Ser His Phe Leu Lys Glu Lys Gly Gly Leu Glu Gly Leu He 
5 310 315 320 

Tyr Ser Pro Lys Arg Gin Glu He Leu Asp Leu Trp Val Tyr His Thr 

325 330 335 

Gin Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr Pro Gly Pro Gly Val 

340 3 45 350 

Arg Tyr Pro Leu Thr Phe Gly Trp Cys Phe Lys Leu Val Pro Val Glu 

3 55 360 365 

Pro Asp Glu Glu Glu Asn Ser Ser Leu Leu His Pro Ala Ser Leu His 

370 375 380 

Gly Thr Glu Asp Thr Glu Arg Glu Val Leu Lys Trp Lys Phe Asp Ser 
385 390 395 400 

His Leu Ala Phe His His Lys Ala Arg Glu Leu His Pro Glu Tyr Tyr 
405 4 io 4 1 5 yr 

Lys Asp Cys 

<210> 27 
<211> 419 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Hybrid protein comprised of Rev-Tat-Nef (RTN) 
<400> 27 

Met Ala Gly Arg Ser Gly Asp Ser Asp Glu Glu Leu Leu Lys Thr Val 

1 5 io 15 

Arg Leu He Lys Phe Leu Tyr Gin Ser Asn Pro Pro Pro Ser Asn Glu 

20 25 30 

Gly Thr Arg Gin Ala Arg Arg Asn Arg Arg Arg Arg Trp Arg Glu Arg 
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35 40 45 

Gin Arg Gin lie Arg Ser He Ser Glu Arg He Leu Ser Thr Phe Leu 

50 55 60 

Gly Arg Pro Ala Glu Pro Val Pro Leu Gin Leu Pro Pro Leu Glu Arg 
65 70 75 80 

Leu Thr Leu Asp Cys Ser Glu Asp Cys Gly Asn Ser Gly Thr Gin Gly 

85 90 95 

Val Gly Ser Pro Gin Val Leu Val Glu Ser Pro Ala Val Leu Glu Pro 

100 105 HO 

Gly Thr Lys Glu Thr Ser Glu Pro Val Asp Pro Arg Leu Glu Pro Trp 

115 120 125 

Lys His Pro Gly Ser Gin Pro Arg Thr Pro Cys Thr Asn Cys Tyr Cys 

130 135 140 

Lys Lys Cys Cys Leu His Cys Gin Val Cys Phe Thr Arg Lys Gly Leu 
145 150 155 160 

Gly He Ser Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Ala Pro 

165 170 175 

Gin Asp Ser Gin Thr His Gin Val Ser Leu Pro Lys Gin Pro Ser Ser 

180 185 190 

Gin Gin Arg Gly Asp Pro Thr Gly Pro Lys Lys Ser Lys Lys Lys Val 

195 200 205 

Glu Arg Glu Thr Glu Ala Asp Pro Phe Asp Lys Leu Val Gly Lys Trp 

210 215 220 

Ser Lys Cys Ser Gly Trp Pro Thr Val Arg Glu Arg Met Lys Gin Ala 
225 230 235 240 

Glu Pro Glu Pro Ala Ala Asp Gly Val Gly Ala Ala Ser Arg Asp Leu 

245 250 255 

Glu Lys His Gly Ala He Thr Ser Ser Asn Thr Ala Thr Asn Asn Ala 

260 265 270 

Ala Cys Ala Trp Leu Glu Ala Gin Glu Glu Glu Glu Val Gly Phe Pro 

275 280 285 

Val Arg Pro Gin Val Pro Leu Arg Pro Met Thr Tyr Lys Gly Ala Leu 

290 295 300 

Asp Leu Ser His Phe Leu Lys Glu Lys Gly Gly Leu Glu Gly Leu He 
305 310 315 320 

Tyr Ser Pro Lys Arg Gin Glu lie Leu Asp Leu Trp Val Tyr His Thr 

325 330 335 

Gin Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr Pro Gly Pro Gly Val 

340 ~ 345 350 

Arg Tyr Pro Leu Thr Phe Gly Trp Cys Phe Lys Leu Val Pro Val Glu 

355 360 365 

Pro Asp Glu Glu Glu Asn Ser Ser Leu Leu His Pro Ala Ser Leu His 

370 • 375 380 

Gly Thr Glu Asp Thr Glu Arg Glu Val Leu Lys Trp Lys Phe Asp Ser 
385 ' 390 395 400 

His Leu Ala Phe His His Lys Ala Arg Glu Leu His Pro Glu Tyr Tyr 
405 410 415 

Lys Asp Cys 



<210> 28 

<211> 419 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Hybrid protein comprised of Tat-Nef-Rev (TNR) 
<400> 28 

Met Glu Pro Val Asp Pro Arg Leu Glu Pro Trp Lys His Pro Gly Ser 

1 5 10 15 

Gin Pro Arg Thr Pro Cys Thr Asn Cys Tyr Cys Lys Lys Cys Cys Leu 

20 25 30 

His Cys Gin Val Cys Phe Thr Arg Lys Gly Leu Gly He Ser Tyr Gly 
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35 40 45 

Arg Lys Lys Arg Arg Gin Arg Arg Arg Ala Pro Gin Asp Ser Gin Thr 

50 55 60 

His Gin Val Ser Leu Pro Lys Gin Pro Ser .Ser Gin Gin Arg Gly Asp 
65 70 75 80 

Pro Thr Gly Pro Lys Lys Ser Lys Lys Lys Val Glu Arg Glu Thr Glu 

85 90 95 

Ala Asp Pro Phe Asp Thr Ser Val Gly Lys Trp Ser Lys Cys Ser Gly 

100 105 110 

Trp Pro Thr Val Arg Glu Arg Met Lys Gin Ala Glu Pro Glu Pro Ala 

115 120 125 ■ 

Ala Asp Gly Val Gly Ala Ala Ser Arg Asp Leu Glu Lys His Gly Ala 

130 135 140 

lie Thr Ser Ser Asn Thr Ala Thr Asn Asn Ala Ala Cys Ala Trp Leu 
145 150 155 " 160 

Glu Ala Gin Glu Glu Glu Glu Val Gly Phe Pro Val Arg Pro Gin Val 

165 170 J 175 

Pro Leu Arg Pro Met Thr Tyr Lys Gly Ala Leu Asp Leu Ser His Phe 

180 185 190 

Leu Lys Glu Lys Gly Gly Leu Glu Gly Leu He Tyr Ser Pro Lys Arg 

195 200 205 

Gin Glu He Leu Asp Leu Trp Val Tyr His Thr Gin Gly Tyr Phe Pro 

210 215 220 

Asp Trp Gin Asn Tyr Thr Pro Gly Pro Gly Val Arg Tyr Pro Leu Thr 
225 230 235 240 

Phe Gly Trp Cys Phe Lys Leu Val Pro Val Glu Pro Asp Glu Glu Glu 

245 250 255 

Asn Ser Ser Leu Leu His Pro Ala Ser Leu His Gly Thr Glu Asp Thr 

260 265 270 

Glu Arg Glu Val Leu Lys Trp Lys Phe Asp Ser His Leu Ala Phe His 

275 280 285 ' 

His Lys Ala Arg Glu Leu His Pro Glu Tyr Tyr Lys Asp Cys Lys Leu 

290 295 300 

Ala Gly Arg Ser Gly Asp Ser Asp Glu Glu Leu Leu Lys Thr Val Arg 
3 °5 310 315 * 320 

Leu He Lys Phe Leu Tyr Gin Ser Asn Pro Pro Pro Ser Asn Glu Gly 

325 330 335 

Thr Arg Gin Ala Arg Arg Asn Arg Arg Arg Arg Trp Arg Glu Arg Gin 

340 345 350 

Arg Gin He Arg Ser He Ser Glu Arg He Leu Ser Thr Phe Leu Gly 

355 360 365 

Arg Pro Ala Glu Pro Val Pro Leu Gin Leu Pro Pro Leu Glu Arg Leu 

370 375 380 

Thr Leu Asp Cys Ser Glu Asp Cys Gly Asn Ser Gly Thr Gin Gly Val 
38 5 390 395 400 

Gly Ser Pro Gin Val Leu Val Glu Ser Pro Ala Val Leu Glu Pro Gly 
405 410 415 

Thr Lys Glu 

<210> 29 
<211> 419 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Hybrid protein comprised of Rev-Nef-Tat <RNT) 
<400> 29 

Met Ala Gly Arg Ser Gly Asp Ser Asp Glu Glu Leu Leu Lys Thr Val 

15 10 15 

Arg Leu He Lys Phe Leu Tyr Gin Ser Asn Pro Pro Pro Ser Asn Glu 

20 25 30 

Gly Thr Arg Gin Ala Arg Arg Asn Arg Arg Arg Arg Trp Arg Glu Arg 



-21- 



WO 02/090558 



PCT/FI02/00379 



35 40 45 

Gin Arg Gin He Arg Ser He Ser Glu Arg lie Leu Ser Thr Phe Leu 

50 55 60 

Gly Arg Pro Ala Glu Pro Val Pro Leu Gin Leu Pro Pro Leu Glu Arg 
65 70 7 5 80 

Leu Thr Leu Asp Cys Ser Glu Asp Cys Gly Asn Ser Gly Thr Gin Gly 

85 90 95 

Val Gly Ser Pro Gin Val Leu Val Glu Ser Pro Ala Val Leu Glu Pro 

100 105 HO 

Glv Thr Lys Glu Thr Ser Val Gly Lys Trp Ser Lys Cys Ser Gly Trp 

115 120 125 

Pro Thr Val Arg Glu Arg Met Lys Gin Ala Glu Pro Glu Pro Ala Ala 

130 ~ 135 140 

Asp Gly Val Gly Ala Ala Ser Arg Asp Leu Glu Lys His Gly Ala He 
145 150 155 160 

Thr Ser Ser Asn Thr Ala Thr Asn Asn Ala Ala Cys Ala Trp Leu Glu 

165 170 175 

Ala Gin Glu Glu Glu Glu Val Gly Phe Pro Val Arg Pro Gin Val Pro 

180 185 190 

Leu Arg Pro Met Thr Tyr Lys Gly Ala Leu Asp Leu Ser His Phe Leu 

195 200 205 

Lys Glu Lys Gly Gly Leu Glu Gly Leu He Tyr Ser Pro Lys Arg Gin 

210 " 215 220 

Glu lie Leu Asp Leu Trp Val Tyr His Thr Gin Gly Tyr Phe Pro Asp 
225 " 230 235 240 

Trp Gin Asn Tyr Thr Pro Gly Pro Gly Val Arg Tyr Pro Leu Thr Phe 

245 250 255 

Gly Trp Cys Phe Lys Leu Val Pro Val Glu Pro Asp Glu Glu Glu Asn 

260 265 270 

Ser Ser Leu Leu His Pro Ala Ser Leu His Gly Thr Glu Asp Thr. Glu 

275 280 285 

Arg Glu Val Leu Lys Trp Lys Phe Asp Ser His Leu Ala Phe His His 

290 * 295 300 

Lys Ala Arg Glu Leu His Pro Glu Tyr Tyr Lys Asp Cys Lys Leu Glu 
305 310 315 _ 320 

Pro Val Asp Pro Arg Leu Glu Pro Trp Lys His Pro Gly Ser Gin Pro 

325 330 335 

Arq Thr Pro Cys Thr Asn Cys Tyr Cys Lys Lys Cys Cys Leu His Cys 

340 345 350 

Gin Val Cys Phe Thr Arg Lys Gly Leu Gly He Ser Tyr Gly Arg Lys 

355 360 365 

Lys Arg Arg Gin Arg Arg Arg Ala Pro Gin Asp Ser Gin Thr His Gin 

370 375 380 

Val Ser Leu Pro Lys Gin Pro Ser Ser Gin Gin Arg Gly Asp Pro Thr 
385 390 395 400 

Gly Pro Lys Lys Ser Lys Lys Lys Val Glu Arg Glu Thr Glu Ala Asp 
405 410 415 

Pro Phe Asp 

<210> 30 

<211> 387 

<212> PRT 

<213> Artificial Sequence 

<223> Protein comprised of Immunodominant parts of the Nef -Tat-Rev (NTR) 

Met°Gly 3 Trp Pro Thr Val Arg Glu Arg Met Lys Gin Ala Glu Pro Glu 

1*5 10 15 

Pro Ala Ala Asp Gly Val Gly Ala Ala Ser Arg Asp Leu Glu Lys His 

20 25 30 

Gly Ala He Thr Ser Ser Asn Thr Ala Thr Asn Asn Ala Ala Cys Ala 
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35 40 4b 

Trp Leu Glu Ala Gin Glu Glu Glu Glu Val Gly Phe Pro Val Arg Pro 

50 55 60 

Gin Val Pro Leu Arg Pro Met Thr Tyr Lys Gly Ala Leu Asp Leu Ser 
6 * 70 75 80 

His Phe Leu Lys Glu Lys Gly Gly Leu Glu Gly Leu He Tyr Ser Pro 

85 90 95 

Lys Arg Gin Glu He Leu Asp Leu Trp Val Tyr His Thr Gin Gly Tyr 
100 105 nn 



-luu 105 no 
Phe Pro Asp Trp Gin Asn Tyr Thr Pro Gly Pro Gly Val Arg Tyr *Pro 

115 - 120 125 

Cys Phe Lys Leu Val Pro Val Glu Pro Asp Glu 

135 140 

Leu Leu His Pro Ala Ser Leu His Gly Thr Glu 

±l±D 150 155 ' 160 

Asp Thr Glu Arg Glu Val Leu Lys Trp Lys Phe Asp Ser His Leu Ala 

165 170 



lbb 170 175 

Phe His His Lys Ala Arg Glu Leu His Pro Glu Tyr Tyr Lys Asp Cys 

180 185 190 

Ala Leu Ala Ala Val Glu Pro Val Asp Pro Arg Leu Glu Pro Trp Lys 

195 200 205 

His Pro Gly Ser Gin Pro Arg Thr Pro Cys Thr Asn Cys Tyr Cys Lys 

210 215 220 

Lys Cys Cys Leu His Cys Gin Val Cys Phe Thr Arg Lys Gly Leu Gly 
225 230 235 ~ 240 

He Ser Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Ala Pro Gin 

245 250 ^ 255 

Asp Ser Gin Thr His Gin Val Ser Leu Pro Lys Gin Pro Ser Ser Gin 

260 265 270 

Gin Arg Gly Asp Pro Thr Gly Pro Lys Lys Ser Gly Leu Ala He Leu 

275 280 285 

Leu Ser Asp Glu Glu Leu Leu Lys Thr Val Arg Leu He Lys Phe Leu 

290 295 300 

Tyr Gin Ser Asn Pro Pro Pro Ser Asn Glu Gly Thr Arg Gin Ala Ara 
l° 5 310 315 320 

Arg Asn Arg Arg Arg Arg Trp Arg Glu Arg Gin Arg Gin lie Arg Ser 

325 330 335 

He Ser Glu Arg He Leu Ser Thr Phe Leu Gly Arg Pro Ala Glu Pro 

• 340 345 V 350 

Val Pro Leu Gin Leu Pro Pro Leu Glu Arg Leu Thr Leu Asp Cys Ser 

355 360 365 

Glu Asp Cys Gly Asn Ser Gly Thr Gin Gly Val Gly Ser Pro Gin Val 
370 375 380 



Leu Val Glu 
385 

<210> 31 

<211> 390 

<212> PRT 

<213> Artificial Sequence 
<220> 



<223> Protein comprised of Immunodominant parts of the Nef-Tat-Rev 
separated by protease sites (NTR) 



<400> 31 



Met Gly Trp Pro Thr Val Arg Glu Arg Met Lys Gin Ala Glu Pro Glu 

15 10 15 

Pro Ala Ala Asp Gly Val Gly Ala Ala Ser Arg Asp Leu Glu Lys His 

20 25 30 

Gly Ala He Thr Ser Ser Asn Thr Ala Thr Asn Asn Ala Ala Cys Ala 

35 40 45 

Trp Leu Glu Ala Gin Glu Glu Glu Glu Val Gly Phe Pro Val Arg Pro 
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50 55 60 

Gin Val Pro Leu Arg Pro Met Thr Tyr Lys Gly Ala Leu Asp Leu Ser 
65 70 75 80 

His Phe Leu Lys Glu Lys Gly Gly Leu Glu Gly Leu He Tyr Ser Pro 

85 90 95 

Lys Arg Gin Glu He Leu Asp Leu Trp Val Tyr His Thr Gin Gly Tyr 

100 105 HO 

Phe Pro Asp Trp Gin Asn Tyr Thr Pro Gly Pro Gly Val Arg Tyr Pro 

115 120 125 

Leu Thr Phe Gly Trp Cys Phe Lys Leu Val Pro Val Glu- Pro Asp Glu 

130 135 140 

Glu Glu Asn Ser Ser Leu Leu His Pro Ala Ser Leu His Gly Thr Glu 
145 150 155 160 

Asp Thr Glu Arg Glu Val Leu Lys Trp Lys Phe Asp Ser His Leu Ala 

165 170 175 

Phe His His Lys Ala Arg Glu Leu His Pro Glu Tyr Tyr Lys Asp Cys 

180 185 190 

Ala Leu Ala Phe Lys Arg Val Glu Pro Val Asp Pro Arg Leu Glu Pro 

195 200 205 

Trp Lys His Pro Gly Ser Gin Pro Arg Thr Pro Cys Thr Asn Cys Tyr 

210 215 220 

Cys Lys Lys Cys Cys Leu His Cys Gin Val Cys Phe Thr Arg Lys Gly 
225 230 235 240 

Leu Gly He Ser Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Ala 

245 250 255 

Pro Gin Asp Ser Gin Thr His Gin Val Ser Leu Pro Lys Gin Pro Ser 

260 265 270 

Ser Gin Gin Arg Gly Asp Pro Thr Gly Pro Lys Lys Ser Val Arg Glu 

275 280 285 

Lys Arg Leu Leu Ser Asp Glu Glu Leu Leu Lys Thr Val Arg Leu He 

290 295 300 

Lys Phe Leu Tyr Gin Ser Asn Pro Pro Pro Ser Asn Glu Gly Thr Arg 
305 ' 310 315 320 

Gin Ala Arg Arg Asn Arg Arg Arg Arg Trp Arg Glu Arg Gin Arg Gin 

325 330 335 

He Arg Ser He Ser Glu Arg lie Leu Ser Thr Phe Leu Gly Arg Pro 

340 345 350 

Ala Glu Pro Val Pro Leu Gin Leu Pro Pro Leu Glu Arg Leu Thr Leu 

355 360 365 

Asp Cys Ser Glu Asp Cys Gly Asn Ser Gly Thr Gin Gly Val Gly Ser 

370 375 380 

Pro Gin Val Leu Val Glu 
385 390 



<210> 32 

<211> 386 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Protein comprised of Immunodominant parts of the regulatory proteins 
Nef-Tat-Rev started from aal of Nef(NHTR) 

<400> 32 

Met Trp Pro Thr Val Arg Glu Arg Met Lys Gin Ala Glu Pro Glu Pro 

15 10 15 

Ala Ala Asp Gly Val Gly Ala Ala Ser Arg Asp Leu Glu Lys His Gly 

20 25 30 

Ala He Thr Ser Ser Asn Thr Ala Thr Asn Asn Ala Ala Cys Ala Trp 

35 40 45 

Leu Glu Ala Gin Glu Glu Glu Glu Val Gly Phe Pro Val Arg Pro Gin 

50 55 60 

Val Pro Leu Arg Pro Met Thr Tyr Lys Gly Ala Leu Asp Leu Ser His 
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65 7 0 75 so 

Phe Leu Lys Glu Lys Gly Gly Leu Glu Gly Leu lie Tyr Ser Pro Lys 

85 90 ~ 95 

Arg Gin Glu He Leu Asp Leu Trp Val Tyr His Thr Gin Gly Tyr Phe 

100 los no 

Pro Asp Trp Gin Asn Tyr Thr Pro Gly Pro Gly Val Arg Tyr Pro Leu " 

115 120 125 

Thr Phe Gly Trp Cys Phe Lys Leu Val Pro Val Glu Pro Asp Glu Glu 

130 135 140 

Glu Asn Ser Ser Leu Leu His Pro Ala Ser Leu His Gly Thr Glu Asd 

it 5 ^ 150 155 i6o • 

Thr Glu Arg Glu Val Leu Lys Trp Lys Phe Asp Ser His Leu Ala Phe 

165 170 i75 

His His Lys Ala Arg Glu Leu His Pro Glu Tyr Tyr Lys Asp Cys Ala 

180 185 190 

Leu Ala Ala Val Glu Pro Val Asp Pro Arg Leu Glu Pro Trp Lys His 

195 200 205 

Pro Gly Ser Gin Pro Arg Thr Pro Cys Thr Asn Cys Tyr Cys Lys Lys 
^ 210 215 220 

Cys Cys Leu His Cys Gin Val Cys Phe Thr Arg Lys Gly Leu Gly He 
225 230 235 240 

Ser Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Ala Pro Gin Asp 

245 250 255 

Ser Gin Thr His Gin Val Ser Leu Pro Lys Gin Pro Ser Ser Gin Gin 

260 265 '270 

Arg Gly Asp Pro Thr Gly Pro Lys Lys Ser Gly Leu Ala He Leu Leu ; 

275 280 285 

Ser Asp Glu Glu Leu Leu Lys Thr Val Arg Leu He Lys Phe Leu Tyr 

290 295 " 300 y 

Gin Ser Asn Pro Pro Pro Ser Asn Glu Gly Thr Arg Gin Ala Arg Arg 
305 310 315 320 

Asn Arg Arg Arg Arg Trp Arg Glu Arg Gin Arg Gin He Arg Ser He 

325 330 335 

Ser Glu Arg He Leu Ser Thr Phe Leu Gly Arg Pro Ala Glu Pro Val 

340 345 350 

Pro Leu Gin Leu Pro Pro Leu Glu Arg Leu Thr Leu Asp Cys Ser Glu 

355 360 365 

Asp Cys Gly Asn Ser Gly Thr Gin Gly Val Gly Ser Pro Gin Val Leu 

370 375 380 

Val Glu 
385 

<210> 33 
<211> 389 
<212> PRT 

<213> Artificial Sequence 
<220> 

v P »° tei !? com P r i sed of Immunodominant parts of the regulatory proteins 
Nef-Tat-Rev started from aal of Nef separated by protease sites (N11TR) 

<400> 33 

Met Trp Pro Thr Val Arg Glu Arg Met Lys Gin Ala Glu Pro Glu Pro 

1 5 io 15 

Ala Ala Asp Gly Val Gly Ala Ala Ser Arg Asp Leu Glu Lys His Gly 

20 25 30 

Ala He Thr Ser Ser Asn Thr Ala Thr Asn Asn Ala Ala Cys Ala Trp 

35 40 45 

Leu Glu Ala Gin Glu Glu Glu Glu Val Gly Phe Pro Val Arg Pro Gin 

50 55 60 

Val Pro Leu Arg Pro Met Thr Tyr Lys Gly Ala Leu Asp Leu Ser His 
65 70 75 80 

Phe Leu Lys Glu Lys Gly Gly Leu Glu Gly Leu lie Tyr Ser Pro Lys 
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85 90 '95 

Arg Gin Glu lie Leu Asp Leu Trp Val Tyr His Thr Gin Gly Tyr Phe 

100 105 HO 

Pro Asp Trp Gin Asn Tyr Thr Pro Gly Pro Gly Val Arg Tyr Pro Leu 

115 120 125 

Thr Phe Gly Trp Cys Phe Lys Leu Val Pro Val Glu Pro Asp Glu Glu ■ 

130 135 140 

Glu Asn Ser Ser Leu Leu His Pro Ala Ser Leu His Gly Thr Glu Asp 
145 150 155 160 

Thr Glu Arg Glu Val Leu Lys Trp Lys Phe Asp Ser His Leu Ala Phe 

165 170 175 

His His Lys Ala Arg Glu Leu His Pro Glu Tyr Tyr Lys Asp Cys Ala 

180 185 190 

Leu Ala Phe Lys Arg Val Glu Pro Val Asp Pro Arg Leu Glu Pro Trp 

195 200 205 

Lys His Pro Gly Ser Gin Pro Arg Thr Pro Cys Thr Asn Cys Tyr Cys 

210 215 220 

Lys Lys Cys Cys Leu His Cys Gin Val Cys Phe Thr Arg Lys Gly Leu 
225 230 235 240 

Gly He Ser Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Ala Pro 

245 250 255 

Gin Asp Ser Gin Thr His Gin Val Ser Leu Pro Lys Gin Pro Ser Ser 

260 265 270 

Gin Gin Arg Gly Asp Pro Thr Gly Pro Lys Lys Ser Val Arg Glu Lys 

275 280 285 

Arg Leu Leu Ser Asp Glu Glu Leu Leu Lys Thr Val Arg Leu He Lys 

290 295 300 

Phe Leu Tyr Gin Ser Asn Pro Pro Pro Ser Asn Glu Gly Thr Arg Gin 
305 310 315 320 

Ala Arg Arg Asn Arg Arg Arg Arg Trp Arg Glu Arg Gin Arg Gin lie 

325 330 335 

Arg Ser He Ser Glu Arg He Leu Ser Thr Phe Leu Gly Arg Pro Ala 

340 345 350 

Glu Pro Val Pro Leu Gin Leu Pro Pro Leu Glu Arg Leu Thr Leu Asp 

355 360 365 

Cys Ser Glu Asp Cys Gly Asn Ser Gly Thr Gin Gly Val Gly Ser Pro 

370 " 375 380 

Gin Val Leu Val Glu 
385 

<210> 34 
<211> 220 
<212> PRT 

<213> Artificial Sequence 

<220> , _ _ ' j n 

<223> Protein comprised of Cytotoxic T-cell epitopes of Pol and Env 

genes (CTL) 

<400> 34 _ 

Met He Thr Leu Trp Gin Arg Pro Leu Val Ala Leu He Glu He Cys 

15 10 15 

Thr Glu Met Glu Lys Glu Gly Lys He Ser Lys He Gly Pro Ala Gly 

20 25 30 

Leu Lys Lys Lys Lys Ser Val Thr Val Leu Asp Val Gly Asp Ala Tyr 

35 40 45 

Phe Ser Val Pro Leu Asp Lys Asp Phe Arg Lys Tyr Thr Ala Phe Thr 

50 55 60 

He Pro Ser He Trp Lys Gly Ser Pro Ala He Phe Gin Ser Ser Met 
65 70 75 80 

Thr Lys Lys Gin Asn Pro Asp He Val He Tyr Gin Tyr Met Asp Asp 

85 90 95 

Leu Tyr Val Pro He Val Leu Pro Glu Lys Asp Ser Trp Leu Val Gly 
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100 105 no 

Lys Leu Asn Trp Ala Ser Gin lie Tyr Ala Gly lie Lys Val Lys Gin 

115 120 125 

Leu He Leu Lys Glu Pro Val His Gly Val Tyr Glu Pro He Val Gly 

!30 135 140 

Ala Glu Thr Phe Tyr Val Asp Gly Ala Ala Asn Arg Ala Gly Asn Leu 
i 45 150 155 160 

Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr Thr 

165 170 175 

Thr Leu Val Glu Arg Tyr Leu Arg Asp Gin Gin Leu Leu Gly He Trp 

180 185 iso 

Gly Cys Ala Cys Thr Pro Tyr Asp He Asn Gin Met Leu Arg Gly Pro 

!95 200 205 

Gly Arg Ala Phe Val Thr He Arg Gin Gly Ser Leu 
21° 215 " 220 

<210> 35 
<2ll> 421 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Truncated Gag protein sequence (dgag) 
<4t)0> 35 

Met Leu Asp Lys Trp Glu Lys He Arg Leu Arg Pro Gly Gly Lys Lys 

1 5 io 15 

Lys Tyr Gin Leu Lys His He Val Trp Ala Ser Arg Glu Leu Glu Arg 

20 25 30 

Phe Ala Val Asn Pro Gly Leu Leu Glu Thr Ser Glu Gly Cys Arg Gin 

35 40 45 

He Met Gly Gin Leu Gin Pro Ser Leu Gin Thr Gly Ser Glu Glu Leu 

. 50 55 60 

Arg Ser Leu Tyr Asn Thr Val Ala Thr Leu Tyr Cys Val His Gin Lys 
65 70 75 80 

He Glu Val Lys Asp Thr Lys Glu Ala Leu Asp Lys Val Glu Glu Glu 

85 90 95 

Gin Asn Asn Ser Lys Lys Lys Ala Gin Gin Glu Ala Ala Asp Ala Gly 

100 105 no 

Asn Arg Asn Gin Val Ser Gin Asn Tyr Pro He Val Gin Asn Leu Gin 

115 120 125 

Gly Gin Met Val His Gin Ala He Ser Pro Arg Thr Leu Asn Ala Tro 

I 30 135 140 

Val Lys Val Val Glu Glu Lys Ala Phe Ser Pro Glu Val He Pro Met 

H 5 150 155 160 

Phe Ser Ala Leu Ser Glu Gly Ala Thr Pro Gin Asp Leu Asn Thr Met 

165 170 175 

Leu Asn Thr Val Gly Gly His Gin Ala Ala Met Gin Met Leu Lys Glu 

180 185 190 

Thr He Asn Glu Glu Ala Ala Glu Trp Asp Arg Leu His Pro Val His 

I 95 200 205 

Ala Gly Pro He Ala Pro Gly Gin Met Arg Glu Pro Arg Gly Ser Asd 

210 215 ~~ 220 

He Ala Gly Thr Thr Ser Thr Leu Gin Glu Gin He Gly Trp Met Thr 
l 2S 2 30 235 ' 240 

Asn Asn Pro Pro He Pro Val Gly Glu He Tyr Lys Arg Trp He .He 

245 250 * 255 

Leu Gly Leu Asn Lys He Val Arg Met Tyr Ser Pro Thr Ser He Leu 

260 265 270 

Asp He Lys Gin Gly Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg 

275 280 285 

Phe Tyr Lys Thr Leu Arg Ala Glu Gin Ala Thr Gin Glu Val Lys Asn 
290 295 300 
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Trp Met Thr Glu Thr Leu Leu Val Gin Asn Ala Asn Pro Asp Cys Lys 
305 310 315 320 

Thr He Leu Lys Ala Leu Gly Pro Ala Ala Thr Leu Glu Glu Met Met 

325 330 335 

Thr Ala Cys Gin Gly Val Gly Gly Pro Gly His Lys Ala Arg Val Leu 

340 345 350 

Ala Glu Ala Met Ser Gin Val Thr Gly Ser Ala Ala He Met Met Gin 

355 360 365 

Arg Gly Asn Phe Arg Asn Gin Arg Lys Thr Val Lys Cys Phe Asn Cys 

370 375 380 

Gly Lys Glu Gly His He Ala Arg Asn Cys Arg Ala Pro Arg Lys Lys 
385 390 395 400 

Gly Cys Trp Lys Cys Gly Lys Glu Gly His Gin Met Lys Asp Cys Thr 

405 410 415 

Glu Arg Gin Ala Asn 
420 

<210> 36 
<2ll> 363 
<212> PRT 

<213> Artificial Sequence 

<220> , . 

<223> Synthetic pl7/24 protein of Gag gene(Syn 17/24) 

<400> 36 

Met Gly Ala Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Lys Trp 

15 io 15 

Glu Lys He Arg Leu Arg Pro Gly Gly Lys Lys Lys Tyr Gin Leu Lys 

20 25 30 

His He Val Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala Val Asn Pro 

35 40 45 

Gly Leu Leu Glu Thr Ser Glu Gly Cys Arg Gin lie Met Gly Gin Leu 

50 55 60 

Gin Pro Ser Leu Gin Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn 
65 70 75 80 

Thr Val Ala Thr Leu Tyr Cys Val His Gin Lys He Glu Val Lys Asp 

85 90 95 

Thr Lys Glu Ala Leu Asp Lys Val Glu Glu Glu Gin Asn Asn Ser Lys 

100 ' 105 HO 

Lys Lys Ala Gin Gin Glu Ala Ala Asp Ala Gly Asn Arg Asn Gin Val 

115 120 125 

Ser Gin Asn Tyr Pro He Val Gin Asn Leu Gin Gly Gin Met Val His 

130 135 140 

Gin Ala He Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val Val Glu 
145 150 155 160 

Glu Lys Ala Phe Ser Pro Glu Val He Pro Met Phe Ser Ala Leu Ser 

165 170 175 

Glu Gly Ala Thr Pro Gin Asp Leu Asn Thr Met Leu Asn Thr Val Gly 

180 185 190 

Gly His Gin Ala Ala Met Gin Met Leu Lys Glu Thr He Asn Glu Glu 

195 200 205 

Ala Ala Glu Trp Asp Arg Leu His Pro Val His Ala Gly Pro He Ala 

210 215 220 

Pro Gly Gin Met Arg Glu Pro Arg Gly Ser Asp He Ala Gly Thr Thr 
225 230 235 240 

Ser Thr Leu Gin Glu Gin He Gly Trp Met Thr Asn Asn Pro Pro He 

245 250 255 

Pro Val Gly Glu He Tyr Lys Arg Trp He He Leu Gly Leu Asn Lys 

260 265 270 

He Val Arg Met Tyr Ser Pro Thr Ser He Leu Asp He Lys Gin Gly 

275 280 285 

Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu 
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290 




Arg Ala Glu 


Gin 


305 




Leu Leu Val 


Gin 


Leu Gly Pro 


Ala 




340 


Val Gly Gly 


Pro 


355 





295 300 
Ala Thr Gin Glu Val Lys Asn Trp Met Thr Glu Thr 
310 315 320 

Asn Ala Asn Pro Asp Cys Lys Thr lie Leu Lys Ala 
325 330 335 

Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gin Gly 

345 350 
Gly His Lys Ala Arg Val Leu 
360 

<210> 37 

<211> 363 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic pl7/24 protein of Gag gene optimized for expression in 
eukaryotic cells (optp 17/24) 

<400> 37 

Met Gly Ala Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Lys Trp 

1 5 io 15 

Glu Lys lie Arg Leu Arg Pro Gly Gly Lys Lys Lys Tyr Gin Leu Lys 

20 25 30 

His He Val, Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala Val Asn Pro 

35 40 45 

Gly Leu Leu Glu Thr Ser Glu Gly Cys Arg Gin He Met Gly Gin Leu 

50 55 60 

Gin Pro Ser Leu Gin Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn 
65 70 75 80 

Thr Val Ala Thr Leu Tyr Cys Val His Gin Lys He Glu Val Lys Asp 

85 90 95 

Thr Lys Glu Ala Leu Asp Lys Val Glu Glu Glu Gin Asn Asn Ser Lys 

100 105 no 

Lys Lys Ala Gin Gin Glu Ala Ala Asp Ala Gly Asn Arg Asn Gin Val 

115 120 125 

Ser Gin Asn Tyr Pro He Val Gin Asn Leu Gin Gly Gin Met Val His 

130 135 140 

Gin Ala He Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val Val Glu 
145 150 155 * 160 

Glu Lys Ala Phe Ser Pro Glu Val He Pro Met Phe Ser Ala Leu Ser 

165 170 175 

Glu Gly Ala Thr Pro Gin Asp Leu Asn Thr Met Leu Asn Thr Val Gly 

180 185 190 

Gly His Gin Ala Ala Met Gin Met Leu Lys Glu Thr He Asn Glu Glu 

195 200 205 

Ala Ala Glu Trp Asp Arg Leu His Pro Val His Ala Gly Pro He Ala 

210 215 220 

Pro Gly Gin Met Arg Glu Pro Arg Gly Ser Asp lie Ala Gly Thr Thr 
225 230 235 " 240 

Ser Thr Leu Gin Glu Gin He Gly Trp Met Thr Asn Asn Pro Pro He 

245 250 255 

Pro Val Gly Glu He Tyr Lys Arg Trp He He Leu Gly Leu Asn Lys 

260 265 270 

He Val Arg Met Tyr Ser Pro Thr Ser He Leu Asp He Lys Gin Gly 

275 280 285 

Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu 

290 295 300 

Arg Ala Glu Gin Ala Thr Gin Glu Val Lys Asn Trp Met Thr Glu Thr 
305 310 3i5 

Leu Leu Val Gin Asn Ala Asn Pro Asp Cys Lys Thr He Leu Lys Ala 

325 330 335 

Leu Gly Pro Ala Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gin Gly 
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340 345 350 

Val Gly Gly Pro Gly His Lys Ala Arg Val Leu 
355 360 

<210> 38 
<211> 641 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Hybrid protein comprised of Tat-Rev-Nef and CTL (TRN-CTL) 
<400> 38 

Met Glu Pro Val Asp Pro Arg Leu Glu Pro Trp Lys His Pro Gly Ser 

15 10 15 

Gin Pro Arg Thr Pro Cys Thr Asn Cys Tyr Cys Lys Lys Cys Cys Leu 

20 25 30 

His Cys Gin Val Cys Phe Thr Arg Lys Gly Leu Gly He Ser Tyr Gly 

35 ' 40 45 

Arg Lys Lys Arg Arg Gin Arg Arg Arg Ala Pro Gin Asp Ser Gin Thr 

50 55 60 

His Gin Val Ser Leu Pro Lys Gin Pro Ser Ser Gin Gin Arg Gly Asp 
65 70 75 80 

Pro Thr Gly Pro Lys Lys Ser Lys Lys Lys Val Glu Arg Glu Thr Glu 

85 90 95 

Ala Asp Pro Phe Asp Thr Ser Ala Gly Arg Ser Gly Asp Ser Asp Glu 

x 100 105 HO 

Glu Leu Leu Lys Thr Val Arg Leu He Lys Phe Leu Tyr Gin Ser Asn 

115 120 125 

Pro Pro Pro Ser Asn Glu Gly Thr Arg Gin Ala Arg Arg Asn Arg Arg 

130 135 140 

Arg Arg Trp Arg Glu Arg Gin Arg Gin He Arg Ser He Ser Glu Arg 
145 150 155 160 

He Leu Ser Thr Phe Leu Gly Arg Pro Ala Glu Pro Val Pro Leu Gin 

165 170 175 

Leu Pro Pro Leu Glu Arg Leu Thr Leu Asp Cys Ser Glu Asp Cys Gly 

180 185 190 

Asn Ser Gly Thr Gin Gly Val Gly Ser Pro Gin Val Leu Val Glu Ser 

195 200 205 

Pro Ala Val Leu Glu Pro Gly Thr Lys Glu Lys Leu Val Gly Lys Trp 

210 215 220 

Ser Lys Cys Ser Gly Trp Pro Thr Val Arg Glu Arg Met Lys Gin Ala 
225 230 235 240 

Glu Pro Glu Pro Ala Ala Asp Gly Val Gly Ala Ala Ser Arg Asp Leu 

245 250 255 

Glu Lys His Gly Ala He Thr Ser Ser Asn Thr Ala Thr Asn Asn Ala 

260 265 270 

Ala Cys Ala Trp Leu Glu Ala Gin Glu Glu Glu Glu Val Gly Phe Pro 

275 280 285 

Val Arg Pro Gin Val Pro Leu Arg Pro Met Thr Tyr Lys Gly Ala Leu 

290 295 300 

Asp Leu Ser His Phe Leu Lys Glu Lys Gly Gly Leu Glu Gly Leu He 
305 310 ~ * 315 320 

Tyr Ser Pro Lys Arg Gin Glu He Leu Asp Leu Trp Val Tyr His Thr 

325 330 335 

Gin Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr Pro Gly Pro Gly Val 

340 345 350 

Arg Tyr Pro Leu Thr Phe Gly Trp Cys Phe Lys Leu Val Pro Val Glu 

355 360 365 

Pro Asp Glu Glu Glu Asn Ser Ser Leu Leu His Pro Ala Ser Leu His 

370 375 380 

Gly Thr Glu Asp Thr Glu Arg Glu Val Leu Lys Trp Lys Phe Asp Ser 
385 ~ 390 395 400 
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His Leu Ala Phe His His Lys Ala Arg Glu Leu His Pro Glu Tyr Tyr 

405 410 415 

Lys Asp Cys Ala Ala Val He Thr Leu Trp Gin Arg Pro Leu Val Ala 

420 425 • 430 

Leu He Glu He Cys Thr Glu Met Glu Lys Glu Gly Lys He Ser Lys 

435 440 445 

He Gly Pro Ala Gly Leu Lys Lys Lys Lys Ser Val Thr Val Leu Asp 

450 455 460 

Val Gly Asp Ala Tyr Phe Ser Val Pro Leu Asp Lys Asp Phe Arg Lys 
465 470 475 480 

Tyr Thr Ala Phe Thr He Pro Ser He Trp Lys Gly Ser Pro Ala He 

485 490 " 495 

Phe Gin Ser Ser Met Thr Lys Lys Gin Asn Pro Asp He Val He Tyr 

500 505 " 510 

Gin Tyr Met Asp Asp Leu Tyr Val Pro He Val Leu Pro Glu Lys Asd 

515 520 525 

Ser Trp Leu Val Gly Lys Leu Asn Trp Ala Ser Gin He Tyr Ala Gly 

n 530 535 5 40 

He Lys Val Lys Gin Leu He Leu Lys Glu Pro Val His Gly Val Tyr 
545 550 555 560 

Glu Pro He Val Gly Ala Glu Thr Phe Tyr Val Asp Gly Ala Ala Asn 

565 570 575 

Arg Ala Gly Asn Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp 

580 585 590 

Lys Glu Ala Thr Thr Thr Leu Val Glu Arg Tyr Leu Arg Asp Gin Gin 

5g 5 600 605 

Leu Leu Gly He Trp Gly Cys Ala Cys Thr Pro Tyr Asp He Asn Gin 

610 615 620 

Met Leu Arg Gly Pro Gly Arg Ala Phe Val Thr He Arg Gin Gly Ser 

III 



<210> 39 
<211> 641 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Hybrid protein comprised of Rev-Nef-Tat and CTL (RNT-CTL) 
<400> 39 

Met Ala Gly Arg Ser Gly Asp Ser Asp Glu Glu Leu Leu Lys Thr Val 

1 5 10 15 

Arg Leu He Lys Phe Leu Tyr Gin Ser Asn Pro Pro Pro Ser Asn Glu 

20 25 30 

Gly Thr Arg Gin Ala Arg Arg Asn Arg Arg Arg Arg Trp Arg Glu Arg 

35 40 45 

Gin Arg Gin He Arg Ser lie Ser Glu Arg He Leu Ser Thr Phe Leu 

50 55 60 

Gly Arg Pro Ala Glu Pro Val Pro Leu Gin Leu Pro Pro Leu Glu Arq 
f 5 u 70 75 80 

Leu Thr Leu Asp Cys Ser Glu Asp Cys Gly Asn Ser Gly Thr Gin Gly 

85 90 95 

Val Gly Ser Pro Gin Val Leu Val Glu Ser Pro Ala Val Leu Glu Pro 

100 105 no 

Gly Thr Lys Glu Thr Ser Val Gly Lys Trp Ser Lys Cys Ser Gly Trp 

115 120 125 

Pro Thr Val Arg Glu Arg Met Lys Gin Ala Glu Pro Glu Pro Ala Ala 

130 135 140 

Asp Gly Val Gly Ala Ala Ser Arg Asp Leu Glu Lys His Gly Ala He 

it 5 150 155 160 

Thr Ser Ser Asn Thr Ala Thr Asn Asn Ala Ala Cys Ala Trp Leu Glu 
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165 170 175 

Ala Gin Glu Glu Glu Glu Val Gly Phe Pro Val Arg Pro Gin Val Pro 

180 185 190 

Leu Arg Pro Met Thr Tyr Lys Gly Ala Leu Asp Leu Ser His Phe Leu 

195 J 200 205 

Lys Glu Lys Gly Gly Leu Glu Gly Leu He Tyr Ser Pro Lys Arg Gin 

210 215 220 

Glu He Leu Asp Leu Trp Val Tyr His Thr Gin Gly Tyr Phe Pro Asp 
225 230 235 240 

Trp Gin Asn Tyr Thr Pro Gly Pro Gly Val Arg Tyr Pro Leu Thr Phe 

245 250 255 

Gly Trp Cys Phe Lys Leu Val Pro Val Glu Pro Asp Glu Glu Glu Asn 

260 265 270 

Ser Ser Leu Leu His Pro Ala Ser Leu His Gly Thr Glu Asp Thr Glu 

275 280 285 

Arg Glu Val Leu Lys Trp Lys Phe Asp Ser His Leu Ala Phe His His 

290 295 300 

Lys Ala Arg Glu Leu His Pro Glu Tyr Tyr Lys Asp Cys Lys Leu Glu 
305 ~ 310 315 320 

Pro Val Asp Pro Arg Leu Glu Pro Trp Lys His Pro Gly Ser Gin Pro 

325 330 335 

Arg Thr Pro Cys Thr Asn Cys Tyr Cys Lys Lys Cys Cys Leu His Cys 

340 345 350 

Gin Val Cys Phe Thr Arg Lys Gly Leu Gly He Ser Tyr Gly Arg Lys 

355 360 365 

Lys Arg Arg Gin Arg Arg Arg Ala Pro Gin Asp Ser Gin Thr His Gin 

370 375 380 

Val Ser Leu Pro Lys Gin Pro Ser Ser Gin Gin Arg Gly Asp Pro Thr 
385 390 395 400 

Gly Pro Lys Lys Ser Lys Lys Lys Val Glu Arg Glu Thr Glu Ala Asp 

405 410 415 

Pro Phe Asp Ala Ala Val He Thr Leu Trp Gin Arg Pro Leu Val Ala 

420 425 430 

Leu He Glu He Cys Thr Glu Met Glu Lys Glu Gly Lys He Ser Lys 

435 440 445 

He Gly Pro Ala Gly Leu Lys Lys Lys Lys Ser Val Thr Val Leu Asp 

450 455 460 

Val Gly Asp Ala Tyr Phe Ser Val Pro Leu Asp Lys Asp Phe Arg Lys 
465 470 475 480 

Tyr Thr Ala Phe Thr He Pro Ser He Trp Lys Gly Ser Pro Ala He 

485 490 495 

Phe Gin Ser Ser Met Thr Lys Lys Gin Asn Pro Asp He Val He Tyr 

500 505 510 

Gin Tyr Met Asp Asp Leu Tyr Val Pro lie Val Leu Pro Glu Lys Asp 

515 520 525 

Ser Trp Leu Val Gly Lys Leu Asn Trp Ala Ser Gin lie Tyr Ala Gly 

530 535 540 

lie Lys Val Lys Gin Leu lie Leu Lys Glu Pro Val His Gly Val Tyr 
545 550 555 560 

Glu Pro He Val Gly Ala Glu Thr Phe Tyr Val Asp Gly Ala Ala Asn 

565 570 575 

Arg Ala Gly Asn Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp 

580 585 590 

Lys Glu Ala Thr Thr Thr Leu Val Glu Arg Tyr Leu Arg Asp Gin Gin 

595 600 60S 

Leu Leu Gly He Trp Gly Cys Ala Cys Thr Pro Tyr Asp He Asn Gin 

610 ~ 615 620 

Met Leu Arg Gly Pro Gly Arg Ala Phe Val Thr He Arg Gin Gly Ser 
625 ~ 630 635 640 

Leu 

<210> 40 
<211> 842 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Hybrid protein cds comprised of Tat-Rev-Nef and truncated Gaq 
protein (TRN-dgag) 

<400> 40 

Met Glu Pro Val Asp Pro Arg Leu Glu Pro Trp Lys His Pro Gly Ser 

1 5 10 ' 15 

Gin Pro Arg Thr Pro Cys Thr Asn Cys Tyr Cys Lys Lys Cys Cys Leu 

20 25 30 

His Cys Gin Val Cys Phe Thr Arg Lys Gly Leu Gly lie Ser Tyr Gly 

35 40 45 

Arg Lys Lys Arg Arg Gin Arg Arg Arg Ala Pro Gin Asp Ser Gin Thr 

50 55 60 

His Gin Val Ser Leu Pro Lys Gin Pro Ser Ser Gin Gin Arg Gly Asp 
65 70 75 80 

Pro Thr Gly Pro Lys Lys Ser Lys Lys Lys Val Glu Arg Glu Thr Glu 

85 90 ~ 95 

Ala Asp Pro Phe Asp Thr Ser Ala Gly Arg Ser Gly Asp Ser Asp Glu 

100 105 no 

Glu Leu Leu Lys Thr Val Arg Leu He Lys Phe Leu Tyr Gin Ser Asn 

115 120 125 

Pro Pro Pro Ser Asn Glu Gly Thr Arg Gin Ala Arg Arg Asn Arq Arq 

130. 135 " 140 

Arg Arg Trp Arg Glu Arg Gin Arg Gin He Arg Ser He Ser Glu Arg 
l « 150 155 160 

He Leu Ser Thr Phe Leu Gly Arg Pro Ala Glu Pro Val Pro Leu Gin 

165 170 175 

Leu Pro Pro Leu Glu Arg Leu Thr Leu Asp Cys Ser Glu Asp Cys Gly 

180 185 * 190 

Asn Ser Gly Thr Gin Gly Val Gly Ser Pro Gin Val Leu Val Glu Ser 

195 200 205 

Pro Ala Val Leu Glu Pro Gly Thr Lys Glu Lys Leu Val Gly Lys Tro 

210 215 * 220 

Ser Lys Cys Ser Gly Trp Pro Thr Val Arg Glu Arg Met Lys Gin Ala 
225 230 235 240 

Glu Pro Glu Pro Ala Ala Asp Gly Val Gly Ala Ala Ser Arg Asp Leu 

245 250 255 

Glu Lys His Gly Ala lie Thr Ser Ser Asn Thr Ala Thr Asn Asn Ala 

260 265 270 

Ala Cys Ala Trp Leu Glu Ala Gin Glu Glu Glu Glu Val Gly Phe Pro 

275 280 285 

Val Arg Pro Gin Val Pro Leu Arg Pro Met Thr Tyr Lys Gly Ala Leu 

290 295 300 

Asp Leu Ser His Phe Leu Lys Glu Lys Gly Gly Leu Glu Gly Leu He 
305 310 315 * 320 

Tyr Ser Pro Lys Arg Gin Glu lie Leu Asp Leu Trp Val Tyr His Thr 

325 330 * 335 

Gin Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr Pro Gly Pro Gly Val 

340 345 350 

Arg Tyr Pro Leu Thr Phe Gly Trp Cys Phe Lys Leu Val Pro Val Glu 

355 360 365 

Pro Asp Glu Glu Glu Asn Ser Ser Leu Leu His Pro Ala Ser Leu His 

370 375 380 

Gly Thr Glu Asp. Thr Glu Arg Glu Val Leu Lys Trp Lys Phe Asp Ser 
385 390 395 400 

His Leu Ala Phe His His Lys Ala Arg Glu Leu His Pro Glu Tyr Tyr 

405 410 415 

Lys Asp Cys Ala Ala Val Leu Asp Lys Trp Glu Lys He Arg Leu Arg 

420 425 430 

Pro Gly Gly Lys Lys Lys Tyr Gin Leu Lys His lie Val Trp Ala Ser 



-33- 



WO 02/090558 



PCT/FI02/00379 



435 440 44b 

Arg Glu Leu Glu Arg Phe Ala Val Asn Pro Gly Leu Leu Glu Thr Ser 

450 455 460 

Glu Gly Cys Arg Gin lie Met Gly Gin Leu Gin Pro Ser Leu Gin Thr 
465 ~ 470 475 480 

Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn Thr Val Ala Thr Leu Tyr 

485 490 495 

Cys Val His Gin Lys He Glu Val Lys Asp Thr Lys Glu Ala Leu Asp 

500 505 510 

Lys Val Glu Glu Glu Gin Asn Asn Ser Lys Lys Lys Ala Gin Gin Glu 

515 520 525 

Ala Ala Asp Ala Gly Asn Arg Asn Gin Val Ser Gin Asn Tyr Pro He 

530 535 540 

Val Gin Asn Leu Gin Gly Gin Met Val His Gin Ala He Ser Pro Arg 
545 550 555 560 

Thr Leu Asn Ala Trp Val Lys Val Val Glu Glu Lys Ala Phe Ser Pro 

565 570 575 

Glu Val He Pro Met Phe Ser Ala Leu Ser Glu Gly Ala Thr Pro Gin 

580 585 590 

Asp Leu Asn Thr Met Leu Asn Thr Val Gly Gly His Gin Ala Ala Met 

595 600 605 

Gin Met Leu Lys Glu Thr He Asn Glu Glu Ala Ala Glu Trp Asp Arg 

610 615 620 

Leu His Pro Val His Ala Gly Pro He Ala Pro Gly Gin Met Arg Glu 
625 630 635 640 

Pro Arg Gly Ser Asp He Ala Gly Thr Thr Ser Thr Leu Gin Glu Gin 

645 650 655 

He Gly Trp Met Thr Asn Asn Pro Pro He Pro Val Gly Glu He Tyr 

660 665 670 

Lys Arg Trp He He Leu Gly Leu Asn Lys He Val Arg Met Tyr Ser 

675 680 685 

Pro Thr Ser He Leu Asp He Lys Gin Gly Pro Lys Glu Pro Phe Arg 

690 ~ 695 700 

Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu Arg Ala Glu Gin Ala Thr 
705 710 715 720 

Gin Glu Val Lys Asn Trp Met Thr Glu Thr Leu Leu Val Gin Asn Ala 

725 730 735 

Asn Pro Asp Cys Lys Thr lie Leu Lys Ala Leu Gly Pro Ala Ala Thr 

740 745 750 

Leu Glu Glu Met Met Thr Ala Cys Gin Gly Val Gly Gly Pro Gly His 

755 . 760 765 

Lys Ala Arg Val Leu Ala Glu Ala Met Ser Gin Val Thr Gly Ser Ala 

770 775 780 

Ala He Met Met Gin Arg Gly Asn Phe Arg Asn Gin Arg Lys Thr Val 
785 790 795 800 

Lys Cys Phe Asn Cys Gly Lys Glu Gly His He Ala Arg Asn Cys Arg 

805 810 815 

Ala Pro Arg Lys Lys Gly Cys Trp Lys Cys Gly Lys Glu Gly His Gin 

820 825 830 

Met Lys Asp Cys Thr Glu Arg Gin Ala Asn 
835 840 



<210> 41 

<211> 1064 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Hybrid protein cds comprised of Tat-Rev-Nef , CTL and truncated Gag 
protein(TRN-TCL-dgag) 

<400> 41 

Met Glu Pro Val Asp Pro Arg Leu Glu Pro Trp Lys His Pro Gly Ser 
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1 5 10 i b 

Gin Pro Arg Thr Pro Cys Thr Asn Cys Tyr Cys Lys Lys Cys Cys Leu 

20 25 30 

His Cys Gin Val Cys Phe Thr Arg Lys Gly Leu Gly lie Ser Tyr Gly 

35 40 45 

Arg Lys Lys Arg Arg Gin Arg Arg Arg Ala Pro Gin Asp Ser Gin Thr 

50 55 60 

His Gin Val Ser Leu Pro Lys Gin Pro Ser Ser Gin Gin Arg Gly Asp 
65 70 75 80 

Pro Thr Gly Pro Lys Lys Ser Lys Lys Lys Val Glu Arg Glu Thr Glu 

85 90 95 

Ala Asp Pro Phe Asp Thr Ser Ala Gly Arg Ser Gly Asp Ser Asp Glu 

100 105 no 

Glu Leu Leu Lys Thr Val Arg Leu He Lys Phe Leu Tyr Gin Ser Asn 

115 120 125 

Pro Pro Pro Ser Asn Glu Gly Thr Arg Gin Ala Arg Arg Asn Arg Arg 

130 135 140 

Arg Arg Trp Arg Glu Arg Gin Arg Gin He Arg Ser He Ser Glu Arg 
i45 150 155 160 

He Leu Ser Thr Phe Leu Gly Arg Pro Ala Glu Pro Val Pro Leu Gin 

165 170 175 

Leu Pro Pro Leu Glu Arg Leu Thr Leu Asp Cys Ser Glu Asp Cys Gly 

180 185 190 

Asn Ser Gly Thr Gin Gly Val Gly Ser Pro Gin Val Leu Val Glu Ser 

195 200 205 

Pro Ala Val Leu Glu Pro Gly Thr Lys Glu Lys Leu Val Gly Lys Trp 

210 215 220 

Ser Lys Cys Ser Gly Trp Pro Thr Val Arg Glu Arg Met Lys Gin Ala 
225 230 235 240 

Glu Pro Glu Pro Ala Ala Asp Gly Val Gly Ala Ala Ser Arg Asp Leu 

245 250 255 

Glu Lys His Gly Ala He Thr Ser Ser Asn Thr Ala Thr Asn Asn Ala 

260 265 . 270 

Ala Cys Ala Trp Leu Glu Ala Gin Glu Glu Glu Glu Val Gly Phe Pro 

275 280 285 

Val Arg Pro Gin Val Pro Leu Arg Pro Met Thr Tyr Lys Gly Ala Leu 

290 295 300 

Asp Leu Ser His Phe Leu Lys Glu Lys Gly Gly Leu Glu Gly Leu He 
305 310 315 320 

Tyr Ser Pro Lys Arg Gin Glu He Leu Asp Leu Trp Val Tyr His Thr 

325 330 335 

Gin Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr Pro Gly Pro Gly Val 

340 345 350 

Arg Tyr Pro Leu Thr Phe Gly Trp Cys Phe Lys Leu Val Pro Val Glu 

355 360 365 

Pro Asp Glu Glu Glu Asn Ser Ser Leu Leu His Pro Ala Ser Leu His 

370 375 380 

Gly Thr Glu Asp Thr Glu Arg Glu Val Leu Lys Trp Lys Phe Asp Ser 
385 390 395 400 

His Leu Ala Phe His His Lys Ala Arg Glu Leu His Pro Glu Tyr Tyr 

405 410 415 

Lys Asp Cys Ala Ala Val He Thr Leu Trp Gin Arg Pro Leu Val Ala 

420 425 " 430 

Leu He Glu He Cys Thr Glu Met Glu Lys Glu Gly Lys He Ser Lys 

435 440 445 

He Gly Pro Ala Gly Leu Lys Lys Lys Lys Ser Val Thr Val Leu Asp 

450 455 ' 46O 

Val Gly Asp Ala Tyr Phe Ser Val Pro Leu Asp Lys Asp Phe Arg Lys 
465 470 475 4^0 

Tyr Thr Ala Phe Thr He Pro Ser He Trp Lys Gly Ser Pro Ala He 

485 490 " 495 

Phe Gin Ser Ser Met Thr Lys Lys Gin Asn Pro Asp He Val He Tyr 
500 505 ^ 510 
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Gin Tyr Met Asp Asp Leu Tyr Val Pro lie Val Leu Pro G±u Lys Asp 

515 520 525 

Ser Trp Leu Val Gly Lys Leu Asn Trp Ala Ser Gin He Tyr Ala Gly 

530 535 540 

He Lys Val Lys Gin Leu He Leu Lys Glu Pro Val His Gly Val Tyr 
545 * 550 555 560 

Glu Pro He Val Gly Ala Glu Thr Phe Tyr Val Asp Gly Ala Ala Asn 

565 570 575 

Arg Ala Gly Asn Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp 

580 585 590 ' 

Lys Glu Ala Thr Thr Thr Leu Val Glu Arg Tyr Leu Arg Asp Gin Gin 

595 600 605 

Leu Leu Gly He Trp Gly Cys Ala Cys Thr Pro Tyr Asp He Asn Gin 

610 615 620 

Met Leu Arg Gly Pro Gly Arg Ala Phe Val Thr He Arg Gin Gly Ser 
625 ' ' 630 635 640 

Leu Ala Ala Val Leu Asp Lys Trp Glu Lys He Arg Leu Arg Pro Gly 

645 650 655 

Gly Lys Lys Lys Tyr Gin Leu Lys His He Val Trp Ala Ser Arg Glu 

660 665 670 

Leu Glu Arg Phe Ala Val Asn Pro Gly Leu Leu Glu Thr Ser Glu Gly 

675 680 685 

Cys Arg Gin He Met Gly Gin Leu Gin Pro Ser Leu Gin Thr Gly Ser 

690 695 700 

Glu Glu Leu Arg Ser Leu Tyr Asn Thr Val Ala Thr Leu Tyr Cys Val 
705 " 710 715 720 

His Gin Lys He Glu Val Lys Asp Thr Lys Glu Ala Leu Asp Lys Val 

725 730 735 

Glu Glu Glu Gin Asn Asn Ser Lys Lys Lys Ala Gin Gin Glu Ala Ala 

740 745 750 

Asp Ala Gly Asn Arg Asn Gin Val Ser Gin Asn Tyr Pro He Val Gin 

755 ~ 760 765 

Asn Leu Gin Gly Gin Met Val His Gin Ala He Ser Pro Arg Thr Leu 

770 775 780 

Asn Ala Trp Val Lys Val Val Glu Glu Lys Ala Phe Ser Pro Glu Val 
785 " 790 795 800 

He Pro Met Phe Ser Ala Leu Ser Glu Gly Ala Thr Pro Gin Asp Leu 

805 810 815 

Asn Thr Met Leu Asn Thr Val Gly Gly His Gin Ala Ala Met Gin Met 

820 825 830 

Leu Lys Glu Thr He Asn Glu Glu Ala Ala Glu Trp Asp Arg Leu His 

835 840 845 

Pro Val His Ala Gly Pro He Ala Pro Gly Gin Met Arg Glu Pro Arg 

850 855 860 

Gly Ser Asp lie Ala Gly Thr Thr Ser Thr Leu Gin Glu Gin He Gly 
865 870 875 880 

Trp Met Thr Asn Asn Pro Pro He Pro Val Gly Glu He Tyr Lys Arg 

885 890 895 

Trp lie lie Leu Gly Leu Asn Lys He Val Arg Met Tyr Ser Pro Thr 

900 905 910 

Ser He Leu Asp He Lys Gin Gly Pro Lys Glu Pro Phe Arg Asp Tyr 
915 920 925 

• Val Asp Arg Phe Tyr Lys Thr Leu Arg Ala Glu Gin Ala Thr Gin Glu 
930 935 940 

Val Lys Asn Trp Met Thr Glu Thr Leu Leu Val Gin Asn Ala Asn Pro 
945 ' 950 955 960 

Asp Cys Lys Thr He Leu Lys Ala Leu Gly Pro Ala Ala Thr Leu Glu 

965 970 975 

Glu Met Met Thr Ala Cys Gin Gly Val Gly Gly Pro Gly His Lys Ala 

980 985 990 

Arg Val Leu Ala Glu Ala Met Ser Gin Val Thr Gly Ser Ala Ala He 

995 1000 1005 

Met Met Gin Arg Gly Asn Phe Arg Asn Gin Arg Lys Thr Val Lys 
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1010 1015 loau 

Cys Phe Asn Cys Gly Lys Glu Gly His He Ala Arg Asn Cys Arq 

1025 1030 1035 

Ala Pro Arg Lys Lys Gly Cys Trp Lys Cys Gly Lys Glu Gly His 

1040 1045 " 1050 

Gin Met Lys Asp Cys Thr Glu Arg Gin Ala Asn 

1055 1060 

<210> 42 

<211> 1064 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Hybrid protein comprised of Tat-Rev-Nef , CTL and truncated Gaq 
protein (TRN-CTL-dgag) 

<400> 42 

Met Ala Gly Arg Ser Gly Asp Ser Asp Glu Glu Leu Leu Lys Thr Val 

15 10 is 

Arg Leu He Lys Phe Leu Tyr Gin Ser Asn Pro Pro Pro Ser Asn Glu 

20 25 30 

Gly Thr Arg Gin Ala Arg Arg Asn Arg Arg Arg Arg Trp Arg Glu Arg 

35 40 45 

Gin Arg Gin He Arg Ser He Ser Glu Arg He Leu Ser Thr Phe Leu 

50 55 60 

Gly Arg Pro Ala Glu Pro Val Pro Leu Gin Leu Pro Pro Leu Glu Arq 
65 70 75 80 

Leu Thr Leu Asp Cys Ser Glu Asp Cys Gly Asn Ser Gly Thr Gin Gly 

85 90 95 

Val Gly Ser Pro Gin Val Leu Val Glu Ser Pro Ala Val Leu Glu Pro 

100 105 no 

Gly Thr Lys Glu Thr Ser Val Gly Lys Trp Ser Lys Cys Ser Gly Trp 

US 120 125 

Pro Thr Val Arg Glu Arg Met Lys Gin Ala Glu Pro Glu Pro Ala Ala 

130 135 140 

Asp Gly Val Gly Ala Ala Ser Arg Asp Leu Glu Lys His Gly Ala He 

it 5 150 I 55 160 

Thr Ser Ser Asn Thr Ala Thr Asn Asn Ala Ala Cys Ala Trp Leu Glu 

165 170 175 

Ala Gin Glu Glu Glu Glu Val Gly Phe Pro Val Arg Pro Gin Val Pro 

180 185 " 190 

Leu Arg Pro Met Thr Tyr Lys Gly Ala Leu Asp Leu Ser His Phe Leu 

195 200 205 

Lys Glu Lys Gly Gly Leu Glu Gly Leu He Tyr Ser Pro Lys Arq Gin 

210 215 220 

Glu He Leu Asp Leu Trp Val Tyr His Thr Gin Gly Tyr Phe Pro Asp 
225 230 235 2 40 

Trp Gin Asn Tyr Thr Pro Gly Pro Gly Val Arg Tyr Pro Leu Thr Phe 

245 250 255 

Gly Trp Cys Phe Lys Leu Val Pro Val Glu Pro Asp Glu Glu Glu Asn 

260 265 " 270 

Ser Ser Leu Leu His Pro Ala Ser Leu His Gly Thr Glu Asp Thr Glu 

275 280 285 

Arg Glu Val Leu Lys Trp Lys Phe Asp Ser His Leu Ala Phe His His 

290 295 3 0 o 

Lys Ala Arg Glu Leu His Pro Glu Tyr Tyr Lys Asp Cys Lys Leu Glu 
305 310 315 320 

Pro Val Asp Pro Arg Leu Glu Pro Trp Lys His Pro Gly Ser Gin Pro 

325 330 335 

Arg Thr Pro Cys Thr Asn Cys Tyr Cys Lys Lys Cys Cys Leu His Cys 

340 345 350 

Gin Val Cys Phe Thr Arg Lys Gly Leu Gly He Ser Tyr Gly Arg Lys 
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355 360 365 

Lys Arg Arg Gin Arg Arg Arg Ala Pro Gin Asp Ser Gin Thr His Gin 

370 ~ 375 380 

Val Ser Leu Pro Lys Gin Pro Ser Ser Gin Gin Arg Gly Asp Pro Thr 

^fttr 390 395 

Gly Pro Lys Lys Ser Lys Lys Lys Val Glu Arg Glu Thr Glu Ala Asp 

405 410 
Pro Phe Asp Ala Ala Val He Thr Leu Trp Gin Arg Pro Leu Val Ala 

420 425 
Leu lie Glu lie Cys Thr Glu Met Glu Lys Glu Gly Lys lie Ser Lys 

435 440 445 

lie Gly Pro Ala Gly Leu Lys Lys Lys Lys Ser Val Thr Val Leu Asp 

450 455 4 60 

val Gly Asp Ala Tyr Phe Ser Val Pro Leu Asp Lys Asp Phe Arg Lys 

j <-c 470 475 

Tyr Thr Ala Phe Thr lie Pro Ser He Trp Lys Gly Ser Pro Ala He 

485 490 49 

Phe Gin Ser Ser Met Thr Lys Lys Gin Asn Pro Asp He Val He Tyr 

500 505 5 

Gin Tyr Met Asp Asp Leu Tyr Val Pro He Val Leu Pro Glu Lys Asp 

515 520 525 

Ser Trp Leu Val Gly Lys Leu Asn Trp Ala Ser Gin lie Tyr Ala Gly 

5-30 535 540 

lie Lys val Lys Gin Leu He Leu Lys Glu Pro Val His Gly Val Tyr 

545 * 55° 555 

Glu Pro lie val Gly Ala Glu Thr Phe Tyr Val Asp Gly Ala Ala Asn 

565 570 575 

Arg Ala Gly Asn Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp 

580 585 590 

Lys Glu Ala Thr Thr Thr Leu Val Glu Arg Tyr Leu Arg Asp Gin Gin 

595 600 505 

Leu Leu Gly He Trp Gly Cys Ala Cys Thr Pro Tyr Asp He Asn Gin 

Met Leu Arg Gly Pro Gly Arg Ala Phe Val Thr He Arg Gin Gly Ser 

£?5 630 635 6 * u 

Leu Ala Ala Val Leu Asp Lys Trp Glu Lys lie Arg Leu Arg Pro Gly 

645 650 655 

Gly Lys Lys Lys Tyr Gin Leu Lys His He Val Trp Ala Ser Arg Glu 

660 66 5 670 

Leu Glu Arg Phe Ala Val Asn Pro Gly Leu Leu Glu Thr Ser Glu Gly 

675 680 685 

Cys Arg Gin He Met Gly Gin Leu Gin Pro Ser Leu Gin Thr Gly Ser 

690 695 700 

Glu Glu Leu Arg Ser Leu Tyr Asn Thr Val Ala Thr Leu Tyr Cys Val 

705 7 1° 715 Z i 

His Gin Lys He Glu Val Lys Asp Thr Lys Glu Ala Leu Asp Lys Val 

725 730 
Glu Glu Glu Gin Asn Asn Ser Lys Lys Lys Ala Gin Gin Glu Ala Ala 

740 74 5 750 , _ 

Asp Ala Gly Asn Arg Asn Gin Val Ser Gin Asn Tyr Pro He Val Gin 

755 7 60 765 

Asn Leu Gin Gly Gin Met Val His Gin Ala He Ser Pro Arg Thr Leu 

770 77 5 780 

Asn Ala Trp Val Lys Val Val Glu Glu Lys Ala Phe Ser Pro Glu Val 

785 7 9° 9 

lie Pro Met Phe Ser Ala Leu Ser Glu Gly Ala Thr Pro Gin Asp Leu 
805 810 8 

Asn Thr Met Leu Asn Thr Val Gly Gly His Gin Ala Ala Met Gin Met 

820 825 830 

Leu Lys Glu Thr He Asn Glu Glu Ala Ala Glu Trp Asp Arg Leu His 

835 84 ° 845 

Pro Val His Ala Gly Pro He Ala Pro Gly Gin Met Arg Glu Pro Arg 
850 655 860 
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865 



915 

Val Asp Arg 
930 

Val Lys Asn 
945 



He 


Ala 


Gly 
870 


Thr 


Thr Ser 


Thr 


Leu 
875 


Gin 


Glu 


Gin 


He 


Gly 
880 


Asn 


Asn 


Pro 


Pro 


He Pro 


Val 


Gly 


Glu 


He 


Tyr 


Lys 


Arg 




885 








890 










895 


Leu 


Gly 


Leu 


Asn 


Lys He 


Val 


Arg 


Met 


Tyr 


Ser 


Pro 


Thr 


900 








905 








910 






Asp 


He 


Lys 


Gin 


Gly Pro 


Lys 


Glu 


Pro 


Phe 


Arg 


Asp 


Tyr 


Phe 








920 








925 




Tyr 


Lys 


Thr 
935 


Leu Arg 


Ala 


Glu 


Gin 
940 


Ala 


Thr 


Gin 


Glu 


Trp 


Met 


Thr 


Glu 


Thr Leu 


Leu 


Val 


Gin 


Asn 


Ala 


Asn 


Pro 


Thr 




950 








955 










960 


He 
965 


Leu 


Lys 


Ala Leu 


Gly 
970 


Pro 


Ala 


Ala 


Thr 


Leu 
975 


Glu 


Thr 


Ala 


Cys 


Gin 


Gly Val 


Gly 


Gly 


Pro 


Gly 


His 


Lys 


Ala 


980 








985 








990 




Ala 


Glu 


Ala 


Met 


Ser Gin Val 


. Thr Gly Ser Ala Ala He 



"5 1000 1005 

Met Met Gin Arg Gly Asn Phe Arg Asn Gin Arg Lys Thr Val Lys 

iOlO 1015 1020 

Cys Phe Asn Cys Gly Lys Glu Gly His He Ala Arg Asn Cys Arq 

1025 1030 1035 

Ala Pro Arg Lys Lys Gly Cys Trp Lys Cys Gly Lys Glu Gly His 

1040 1045 1050 

Gin Met Lys Asp Cys Thr Glu Arg Gin Ala Asn 

1055 1060 

<210> 43 

<211> 1064 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Hybrid protein comprised of Tat-Rev-Nef , truncated Gag protein and 
CTL (TRN-dgag-CTL) 



10 15 
Asn Cys Tyr Cys Lys Lys Cys Cys Leu 

25 30 
Arg Lys Gly Leu Gly lie Ser Tyr Gly 
40 45 
Arg Arg Ala Pro Gin Asp Ser Gin Thr 
60 

Gin Pro Ser Ser Gin Gin Arg Gly Asp 

75 80 
Lys Lys Lys Val Glu Arg Glu Thr Glu 

90 95 
Ala Gly Arg Ser Gly Asp Ser Asp Glu 

105 no 
Leu He Lys Phe Leu Tyr Gin Ser Asn 
120 125 
Thr Arg Gin Ala Arg Arg Asn Arg Arg 
140 

Arg Gin He Arg Ser He Ser Glu Arg 
155 160 
Arg Pro Ala Glu Pro Val Pro Leu Gin 

170 175 
Thr Leu Asp Cys Ser Glu Asp Cys Gly 

185 190 
Gly Ser Pro Gin Val Leu Val Glu Ser 
195 200 205 



<400> ' 


43 








Met 


Glu 


Pro 


Val 


Asp 


Pro Arg 


1 








5 


Gin 


Pro 


Arg 


Thr 


Pro 


Cys Thr 








20 






His 


Cys 


Gin 


Val 


Cys 


Phe Thr 






35 








Arg 


Lys 


Lys 


Arg 


Arg 


Gin Arg 




50 








55 


His 


Gin 


Val 


Ser 


Leu 


Pro Lys 


65 










70 


Pro 


Thr 


Gly 


Pro 


Lys 


Lys Ser 










85 




Ala 


Asp 


Pro 


Phe 


Asp 


Thr Ser 








100 






Glu 


Leu 


Leu 


Lys 


Thr 


Val Arg 






115 






Pro 


Pro 


Pro 


Ser 


Asn 


Glu Gly 




130 








135 


Arg 


Arg 


Trp 


Arg 


Glu 


Arg Gin 


145 










150 


He 


Leu 


Ser 


Thr 


Phe 


Leu Gly 










165 


Leu 


Pro 


Pro 


Leu 


Glu 


Arg Leu 








180 




Asn 


Ser 


Gly 


Thr 


Gin 


Gly Val 
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Pro Ala Val Leu Glu Pro Gly Thr Lys Glu Lys Leu vai uxy uys Trp 

210 215 220 

Ser Lys Cys Ser Gly Trp Pro Thr Val Arg Glu Arg Met Lys Gin Ala 
225 230 235 240 

Glu Pro Glu Pro Ala Ala Asp Gly Val Gly Ala Ala Ser Arg Asp Leu 

245 250 255 

Glu Lys His Gly Ala He Thr Ser Ser Asn Thr Ala Thr Asn Asn Ala 

260 265 270 

Ala Cys Ala Trp Leu Glu Ala Gin Glu Glu Glu Glu Val Gly Phe Pro 

275 280 285 

Val Arg Pro Gin Val Pro Leu Arg Pro Met Thr Tyr Lys Gly Ala Leu 

290 295 300 

Asp Leu Ser His Phe Leu Lys Glu Lys Gly Gly Leu Glu Gly Leu He 
305 310 315 320 

Tyr Ser Pro Lys Arg Gin Glu He Leu Asp Leu Trp Val Tyr His Thr 

325 330 335 

Gin Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr Pro Gly Pro Gly Val 

340 345 350 

Arg Tyr Pro Leu Thr Phe Gly Trp Cys Phe Lys Leu Val Pro Val Glu- 

355 360 365 

Pro Asp Glu Glu Glu Asn Ser Ser Leu Leu His Pro Ala Ser Leu His 

370 375 380 

Gly Thr Glu Asp Thr Glu Arg Glu Val Leu Lys Trp Lys Phe Asp Ser 
385 390 ~ 395 400 

His Leu Ala Phe His His Lys Ala Arg Glu Leu His Pro Glu Tyr Tyr 

405 410 415 

Lys Asp Cys Ala Ala Val Leu Asp Lys Trp Glu Lys He Arg Leu Arg 

420 425 430 

Pro Gly Gly Lys Lys Lys Tyr Gin Leu Lys His He Val Trp Ala Ser 

435 440 445 

Arg Glu Leu Glu Arg Phe Ala Val Asn Pro Gly Leu Leu Glu Thr Ser 

450 455 460 

Glu Gly Cys Arg Gin He Met Gly Gin Leu Gin Pro Ser Leu Gin Thr 
465 470 475 480 

Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn Thr Val Ala Thr Leu Tyr 

485 490 495 

Cys Val His Gln-Lys He Glu Val Lys Asp Thr Lys Glu Ala Leu Asp 

500 505 510 

Lys Val Glu Glu Glu Gin Asn Asn Ser Lys Lys Lys Ala Gin Gin Glu 

515 520 525 

Ala Ala Asp Ala Gly Asn Arg Asn Gin Val Ser Gin Asn Tyr Pro He 

530 ^ 535 540 

Val Gin Asn Leu Gin Gly Gin Met Val His Gin Ala He Ser Pro Arg 
545 550 555 560 

Thr Leu Asn Ala Trp Val Lys Val Val Glu Glu Lys Ala Phe Ser Pro 

565 570 575 

Glu Val He Pro Met Phe Ser Ala Leu Ser Glu Gly Ala Thr Pro Gin 

580 585 590 

Asp Leu Asn Thr Met Leu Asn Thr Val Gly Gly His Gin Ala Ala Met 

595 600 60S 

Gin Met Leu Lys Glu Thr He Asn Glu Glu Ala Ala Glu Trp Asp Arg 

610 615 620 

Leu His Pro Val His Ala Gly Pro He Ala Pro Gly Gin Met Arg Glu 
625 630 635 640 

Pro Arg Gly Ser Asp He Ala Gly Thr Thr Ser Thr Leu Gin Glu Gin 

645 650 655 

He Gly Trp Met Thr Asn Asn Pro Pro He Pro Val Gly Glu He Tyr 

660 665 670 

Lys Arg Trp He He Leu Gly Leu Asn Lys He Val Arg Met Tyr Ser 

675 680 685 

Pro Thr Ser He Leu Asp He Lys Gin Gly Pro Lys Glu Pro Phe Arg 

690 695 700 

Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu Arg Ala Glu Gin Ala Thr 
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705 710 715 720 

Gin Glu Val Lys Asn Tip Met Thr Glu Thr Leu Leu Val Gin Asn Ala 

725 730 735 

Asn Pro Asp Cys Lys Thr He Leu Lys Ala Leu Gly Pro Ala Ala Thr 

740 745 750 

Leu Glu Glu Met Met Thr Ala Cys Gin Gly Val Gly Gly Pro Gly His 

n 755 760 765 

Lys Ala Arg Val Leu Ala Glu Ala Met Ser Gin Val Thr Gly Ser Ala 

770 775 780 

Ala He Met Met Gin Arg Gly Asn Phe Arg Asn Gin Arg Lys Thr Val 
785 790 795 800 

Lys Cys Phe Asn Cys Gly Lys Glu Gly His He Ala Arg Asn Cys Arg 

805 810 ^ 815 

Ala Pro Arg Lys Lys Gly Cys Trp Lys Cys Gly Lys Glu Gly His Gin 

820 825 830 

Met Lys Asp Cys Thr Glu Arg Gin Ala Asn Ala Ala Val He Thr Leu 

835 840 845 

Trp Gin Arg Pro Leu Val Ala Leu He Glu He Cys Thr Glu Met Glu 

850 855 860 

Lys Glu Gly Lys He Ser Lys He Gly Pro Ala Gly Leu Lys Lys Lys 
865 870 875 880 

Lys Ser Val Thr Val Leu Asp Val Gly Asp Ala Tyr Phe Ser Val Pro 

885 890 895 

Leu Asp Lys Asp Phe Arg Lys Tyr Thr Ala Phe Thr He Pro Ser He 

500 905 910 

Trp Lys Gly Ser Pro Ala He Phe Gin Ser Ser Met Thr Lys Lys Gin 

915 920 925 

Asn Pro Asp He Val He Tyr Gin Tyr Met Asp Asp Leu Tyr Val Pro 

930 935 940 

He Val Leu Pro Glu Lys Asp Ser Trp Leu Val Gly Lys Leu Asn Trp 
9 ? 5 _ 950 955 960 

Ala Ser Gin He Tyr Ala Gly He Lys Val Lys Gin Leu He Leu Lys 

965 970 975 

Glu Pro Val His Gly Val Tyr Glu Pro He Val Gly Ala Glu Thr Phe 

980 985 ' 990 

Tyr Val Asp Gly Ala Ala Asn Arg Ala Gly Asn Leu Trp Val Thr Val 

"5 1000 1005 

Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Val 

1010 1015 1020 

Glu Arg Tyr Leu Arg Asp Gin Gin Leu Leu Gly He Trp Gly Cys 

1025 1030 1035 

Ala Cys Thr Pro Tyr Asp He Asn Gin Met Leu Arg Gly Pro Gly 

1040 1045 1050 

Arg Ala Phe Val Thr He Arg Gin Gly Ser Leu 
1055 1060 

<210> 44 

<211> 1064 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Hybrid protein comprised of Rev-Nef-Tat, truncated Gag protein and 
CTL (RNT-dgag-CTL) 

<400> 44 

Met Ala Gly Arg Ser Gly Asp Ser Asp Glu Glu Leu Leu Lys Thr Val 

15 10 15 

Arg Leu He Lys Phe Leu Tyr Gin Ser Asn Pro Pro Pro Ser Asn Glu 

20 25 30 

Gly Thr Arg Gin Ala Arg Arg Asn Arg Arg Arg Arg Trp Arg Glu Arg 

35 40 45 

Gin Arg Gin He Arg Ser lie Ser Glu Arg He Leu Ser Thr Phe Leu 
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50 55 60 

Gly Arg Pro Ala Glu Pro Val Pro Leu Gin Leu Pro Pro Leu Glu Arg 
65 70 . 75 80 

Leu Thr Leu Asp Cys Ser Glu Asp Cys Gly Asn Ser Gly Thr Gin Gly 

85 90 95 

Val Gly Ser Pro Gin Val Leu Val Glu Ser Pro Ala Val Leu Glu Pro 

100 105 HO 

Gly Thr Lys Glu Thr Ser Val Gly Lys Trp Ser Lys Cys Ser Gly Trp 

115 120 125 

Pro Thr Val Arg Glu Arg Met Lys Gin Ala Glu Pro Glu Pro Ala Ala 

130 135 140 

Asp Gly Val Gly Ala Ala Ser Arg Asp Leu Glu Lys His Gly Ala lie 
145 150 155 160 

Thr Ser Ser Asn Thr Ala Thr Asn Asn Ala Ala Cys Ala Trp Leu Glu 

165 170 175 

Ala Gin Glu Glu Glu Glu Val Gly Phe Pro Val Arg Pro Gin Val Pro 

180 185 190 

Leu Arg Pro Met Thr Tyr Lys Gly Ala Leu Asp Leu Ser His Phe Leu 

195 200 205 

Lys Glu Lys Gly Gly Leu Glu Gly Leu lie Tyr Ser Pro Lys Arg Gin 

210 215 220 

Glu lie Leu Asp Leu Trp Val Tyr His Thr Gin Gly Tyr Phe Pro Asp 
225 230 235 240 

Trp Gin Asn Tyr Thr Pro Gly Pro Gly Val Arg Tyr Pro Leu Thr Phe 

245 250 255 

Gly Trp Cys Phe Lys Leu Val Pro Val Glu Pro Asp Glu Glu Glu Asn 

260 265 270 

Ser Ser Leu Leu His Pro Ala Ser Leu His Gly Thr Glu Asp Thr Glu 

275 280 285 

Arg Glu Val Leu Lys Trp Lys Phe Asp Ser His Leu Ala Phe His His 

290 295 300 

Lys Ala Arg Glu Leu His Pro Glu Tyr Tyr Lys Asp Cys Lys Leu Glu 
305 310 315 320 

Pro Val Asp Pro Arg Leu Glu Pro Trp Lys His Pro Gly Ser Gin Pro 

325 330 335 

Arg Thr Pro Cys Thr Asn Cys Tyr Cys Lys Lys Cys Cys Leu His Cys 

340 345 350 

Gin Val Cys Phe Thr Arg Lys Gly Leu Gly He Ser Tyr Gly Arg Lys 

355 360 365 

Lys Arg Arg Gin Arg Arg Arg Ala Pro Gin Asp Ser Gin Thr His Gin 

370 375 380 

Val Ser Leu Pro Lys Gin Pro Ser Ser Gin Gin Arg Gly Asp Pro Thr 
385 390 395 400 

Gly Pro Lys Lys Ser Lys Lys Lys Val Glu Arg Glu Thr Glu Ala Asp 

405 410 415 

Pro Phe Asp Ala Ala Val Leu Asp Lys Trp Glu Lys He Arg Leu Arg 

420 425 430 

Pro Gly Gly Lys Lys Lys Tyr Gin Leu Lys His He Val Trp Ala Ser 

435 440 445 

Arg Glu Leu Glu Arg Phe Ala Val Asn Pro Gly Leu Leu Glu Thr Ser 

450 455 460 

Glu Gly Cys Arg Gin He Met Gly Gin Leu Gin Pro Ser Leu Gin Thr 
465 470 475 480 

Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn Thr Val Ala Thr Leu Tyr 

485 490 * 495 

Cys Val His Gin Lys He Glu Val Lys Asp Thr Lys Glu Ala Leu Asp 

500 505 510 

Lys Val Glu Glu Glu Gin Asn Asn Ser Lys Lys Lys Ala Gin Gin Glu 

515 520 525 

Ala Ala Asp Ala Gly Asn Arg Asn Gin Val Ser Gin Asn Tyr Pro He 

530 535 540 

Val Gin Asn Leu Gin Gly Gin Met Val His Gin Ala He Ser Pro Arg 
545 550 555 560 
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Thr Leu Asn Ala Trp Val Lys Val Val Glu Glu Lys Ala Phe Ser Pro 

565 570 575 

Glu val He Pro Met Phe Ser Ala Leu Ser Glu Gly Ala Thr Pro Gin 

580 585 590 

Asp Leu Asn Thr Met Leu Asn Thr Val Gly Gly His Gin Ala Ala Met 

595 600 60S 

Gin Met Leu Lys Glu Thr He Asn Glu Glu Ala Ala Glu Trp Asp Arg 

610 615 620 

Leu His Pro Val His Ala Gly Pro He Ala Pro Gly Gin Met Arg Glu 

t * , 630 635 640 

Pro Arg Gly Ser Asp He Ala Gly Thr Thr Ser Thr Leu Gin Glu Gin 

645 650 655 

He Gly Trp Met Thr Asn Asn Pro Pro He Pro Val Gly Glu He Tyr 

660 665 670 

Lys Arg Trp He He Leu Gly Leu Asn Lys He Val Arg Met Tyr Ser 

675 680 * 685 

Pro Thr Ser lie Leu Asp He Lys Gin Gly Pro Lys Glu Pro Phe Arq 

690 695 700 

Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu Arg Ala Glu Gin Ala Thr 
705 710 715 720 

Gin Glu Val Lys Asn Trp Met Thr Glu Thr Leu Leu Val Gin Asn Ala 

725 730 735 

Asn Pro Asp Cys Lys Thr He Leu Lys Ala Leu Gly Pro Ala Ala Thr 

740 745 ?50 

Leu Glu Glu Met Met Thr Ala Cys Gin Gly Val Gly Gly Pro Gly His 

7 55 760 765 

Lys Ala Arg Val Leu Ala Glu Ala Met Ser Gin Val Thr Gly Ser Ala 

770 775 780 

Ala He Met Met Gin Arg Gly Asn Phe Arg Asn Gin Arg Lys Thr Val 
790 795 " 800 

Lys Cys Phe Asn Cys Gly Lys Glu Gly His He Ala Arg Asn Cys Arg 

. 805 810 815 

Ala Pro Arg Lys Lys Gly Cys Trp Lys Cys Gly Lys Glu Gly His Gin 

820 825 830 

Met Lys Asp Cys Thr Glu Arg Gin Ala Asn Ala Ala Val He Thr Leu 

835 840 845 

Trp Gin Arg Pro Leu Val Ala Leu He Glu He Cys Thr Glu Met Glu 

850 855 860 

Lys Glu Gly Lys He Ser Lys He Gly Pro Ala Gly Leu Lys Lys Lys 
865 870 875 880 

Lys Ser Val Thr Val Leu Asp Val Gly Asp Ala Tyr Phe Ser Val Pro 

885 890 895 

Leu Asp Lys Asp Phe Arg Lys Tyr Thr Ala Phe Thr He Pro Ser He 

9 00 905 910 

Trp Lys Gly Ser Pro Ala He Phe Gin Ser Ser Met Thr Lys Lys Gin 

915 920 925 

Asn Pro Asp lie Val He Tyr Gin Tyr Met Asp Asp Leu Tyr Val Pro 

930 935 940 

lie Val Leu Pro Glu Lys Asp Ser Trp Leu Val Gly Lys Leu Asn Trp 
_ 950 9 55 960 

Ala Ser Gin He Tyr Ala Gly He Lys Val Lys Gin Leu He Leu Lys 

9 *5 970 975 

Glu Pro Val His Gly Val Tyr Glu Pro He Val Gly Ala Glu Thr Phe 

98 0 985 990 

Tyr Val Asp Gly Ala Ala Asn Arg Ala Gly Asn Leu Trp Val Thr Val 
_ 995 1000 1005 

Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Val 

, 1010 1015 1020 

Glu Arg Tyr Leu Arg Asp Gin Gin Leu Leu Gly He Trp Gly Cys 

1025 1030 1035 

Ala Cys Thr Pro Tyr Asp He Asn Gin Met Leu Arg Gly Pro Gly 

1040 104 5 1050 

Arg Ala Phe Val Thr He Arg Gin Gly Ser Leu 
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1055 1060 

<210> 45 

<211> 1006 

<212> PRT 

<213'> Artificial Sequence 
<220> 

<223> Hybrid protein cds comprised of Tat-Rev-Nef , truncated Gag protein 
and CTL{TRN-optpl7/24-CTL ) 

<400> 45 

Met Glu Pro Val Asp Pro Arg Leu Glu Pro Trp Lys HiB Pro Gly Ser 

15 10 15 

Gin Pro Arg Thr Pro Cys Thr Asn Cys Tyr Cys Lys Lys Cys Cys Leu 

20 25. 30 

His CyB Gin Val Cys Phe Thr Arg Lys Gly Leu Gly He Ser Tyr Gly 

35 40 45 

Arg Lys Lys Arg Arg Gin Arg Arg Arg Ala Pro Gin Asp Ser Gin Thr 

50 " 55 60 

His Gin Val Ser Leu Pro Lys Gin Pro Ser Ser Gin Gin Arg Gly Asp 
65 70 75 80 

Pro Thr Gly Pro Lys Lys Ser Lys Lys Lys Val Glu Arg Glu Thr Glu 

85 90 95 

Ala Asp Pro Phe Asp Thr Ser Ala Gly Arg Ser Gly Asp Ser Asp Glu- 

100 105 HO 

Glu Leu Leu Lys Thr Val Arg Leu He Lys Phe Leu Tyr Gin Ser Asn 

115 120 125 

Pro Pro Pro Ser Asn Glu Gly Thr Arg Gin Ala Arg Arg Asn Arg Arg 

130 135 140 

Arg Arg Trp Arg Glu Arg Gin Arg Gin He Arg Ser He Ser Glu Arg 
145 150 155 160 

lie Leu Ser Thr Phe Leu Gly Arg Pro Ala Glu Pro Val Pro Leu Gin 

165 170 175 

Leu Pro Pro Leu Glu Arg Leu Thr Leu Asp Cys Ser Glu Asp Cys Gly 

180 185 190 

Asn Ser Gly Thr Gin Gly Val Gly Ser Pro Gin Val Leu Val Glu Ser 

195 200 205 

Pro Ala Val Leu Glu Pro Gly Thr Lys Glu Lys Leu Val Gly Lys Trp 

210 215 220 

Ser Lys Cys Ser Gly Trp Pro Thr Val Arg Glu Arg Met Lys Gin Ala 
225 230 235 240 

Glu Pro Glu Pro Ala Ala Asp Gly Val Gly Ala Ala Ser Arg Asp Leu 

245 250 255 

Glu Lys His Gly Ala He Thr Ser Ser Asn Thr Ala Thr Asn Asn Ala 

260 265 270 

Ala Cys Ala Trp Leu Glu Ala Gin Glu Glu Glu Glu Val Gly Phe Pro 

275 280 285 

Val Arg Pro Gin Val Pro Leu Arg Pro Met Thr Tyr Lys Gly Ala Leu 

290 295 300 

Asp Leu Ser His Phe Leu Lys Glu Lys Gly Gly Leu Glu Gly Leu He 
305 310 315 320 

Tyr Ser Pro Lys Arg Gin Glu He Leu Asp Leu Trp Val Tyr His Thr 

325 330 335 

Gin Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr Pro Gly Pro Gly Val 

340 345 350 

Arg Tyr Pro Leu Thr Phe Gly Trp Cys Phe Lys Leu Val Pro Val Glu 

355 360 365 

Pro Asp Glu Glu Glu Asn Ser Ser Leu Leu His Pro Ala Ser Leu His 

370 375 380 

Gly Thr Glu Asp Thr Glu Arg Glu Val Leu Lys Trp Lys Phe Asp Ser 
385 390 395 400 

His Leu Ala Phe His His Lys Ala Arg Glu Leu His Pro Glu Tyr Tyr 
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405 410 415 

Lys Asp Cys Ala Ala Val Gly Ala Arg Ala Ser Val Leu Ser Gly Gly 

420 425 430 

Glu Leu Asp Lys Trp Glu Lys lie Arg Leu Arg Pro Gly Gly Lys Lys 

435 440 445 

Lys Tyr Gin Leu Lys His He Val Trp Ala Ser Arg Glu Leu Glu Arq 

450 455 460 

Phe Ala Val Asn Pro Gly Leu Leu Glu Thr Ser Glu Gly Cys Arg Gin 
4 f 5 470 475 ~ 480 

He Met Gly Gin Leu Gin Pro Ser Leu Gin Thr Gly Ser Glu Glu Leu 

485 490 495 

Arg Ser Leu Tyr Asn Thr Val Ala Thr Leu Tyr Cys Val His Gin Lys 

500 505 510 

He Glu Val Lys Asp Thr Lys Glu Ala Leu Asp Lys Val Glu Glu Glu 

515 520 525 

Gin Asn Asn Ser Lys Lys Lys Ala Gin Gin Glu Ala Ala Asp Ala Gly 

530 535 540 

Asn Arg Asn Gin Val Ser Gin Asn Tyr Pro He Val Gin Asn Leu Gin 
%* S 55 0 555 560 

Gly Gin Met Val His Gin Ala He Ser Pro Arg Thr Leu Asn Ala Trp 

565 570 " 575 

Val Lys Val Val Glu Glu Lys Ala Phe Ser Pro Glu Val He Pro Met 

580 585 590 

Phe Ser Ala Leu Ser Glu Gly Ala Thr Pro Gin Asp Leu Asn Thr Met 

595 600 605 

Leu Asn Thr Val Gly Gly His Gin Ala Ala Met Gin Met Leu Lys Glu 

610 615 620 

Thr He Asn Glu Glu Ala Ala Glu Trp Asp Arg Leu His Pro Val His 
625 630 635 640 

Ala Gly Pro He Ala Pro Gly Gin Met Arg Glu Pro Arg Gly Ser Asp 

64 5 650 655 

He Ala Gly Thr Thr Ser Thr Leu Gin Glu Gin He Gly Trp Met Thr 

660 665 670 

Asn Asn Pro Pro He Pro Val Gly Glu He Tyr Lys Arg Trp He lie 

675 680 685 

Leu Gly Leu Asn Lys He Val Arg Met Tyr Ser Pro Thr Ser He Leu 

690 695 700 

Asp He Lys Gin Gly Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arq 
™ 5 710 715 725 

Phe Tyr Lys Thr Leu Arg Ala Glu Gin Ala Thr Gin Glu Val Lys Asn 

725 730 735 

Trp Met Thr Glu Thr Leu Leu Val Gin Asn Ala Asn Pro Asp Cys Lys 

740 745 750 

Thr He Leu Lys Ala Leu Gly Pro Ala Ala Thr Leu Glu Glu Met Met 

755 760 765 

Thr Ala Cys Gin Gly Val Gly Gly Pro Gly His Lys Ala Arg Val Leu 

, 770 775 780 

Ala Ala Val lie Thr Leu Trp Gin Arg Pro Leu Val Ala Leu lie Glu 
785 790 795 800 

He Cys Thr Glu Met Glu Lys Glu Gly Lys lie Ser Lys He Gly Pro 

805 810 815 

Ala Gly Leu Lys Lys Lys Lys Ser Val Thr Val Leu Asp Val Gly Asp 
_ - 820 825 8 30 

Ala Tyr Phe Ser Val Pro Leu Asp Lys Asp Phe Arg Lys Tyr Thr Ala 

835 840 ~ 845 

Phe Thr He Pro Ser He Trp Lys Gly Ser Pro Ala He Phe Gin Ser 

850 855 860 

Ser Met Thr Lys Lys Gin Asn Pro Asp He Val He Tyr Gin Tyr Met 
865 870 8 75 ~ 880 

Asp Asp Leu Tyr Val Pro lie Val Leu Pro Glu Lys Asp Ser Trp Leu 

885 8 9o * 89 5 

Val Gly Lys Leu Asn Trp Ala Ser Gin lie Tyr Ala Gly lie Lys Val 
900 905 " 910 
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Lys Gin Leu lie Leu Lys Glu Pro Val His Gly Val Tyr Glu Pro lie 

1 915 920 925 

Val Gly Ala Glu Thr Phe Tyr Val Asp Gly Ala Ala Asn Arg Ala Gly 

930 935 940 

Asn Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala 
945 ~ 950 955 9*>0 

Thr Thr Thr Leu Val Glu Arg Tyr Leu Arg Asp Gin Gin Leu Leu Gly 

965 970 975 

He Trp Gly Cys Ala Cys Thr Pro Tyr Asp He Asn Gin Met Leu Arg 

980 985 990 

Gly pro Gly Arg Ala Phe Val Thr He Arg Gin Gly Ser Leu 
995 1000 1005 

<210> 46 
<211> 1006 
<212> PRT 

<213> Artificial Sequence 

<223> Hybrid protein cdscomprised of Tat-Rev-Nef , CTL and truncated Gag 
protein (TRN-CTL-optpl7/24) 

Met°Glu 4 pro Val Asp Pro Arg Leu Glu Pro Trp Lys His Pro Gly Ser 

1 5 io 15 

Gin Pro Arg Thr Pro Cys Thr Asn Cys.Tyr Cys Lys Lys Cys Cys Leu 

20 25 30 

His Cys Gin Val Cys Phe Thr Arg Lys Gly Leu Gly He Ser Tyr Gly 

35 40 4 5 

Arg Lys Lys Arg Arg Gin Arg Arg Arg Ala Pro Gin Asp Ser Gin Thr 

50 55 60 

His Gin Val Ser Leu Pro Lys Gin Pro Ser Ser Gin Gin Arg Gly Asp 
65 70 75 80 

Pro Thr Gly Pro Lys Lys Ser Lys Lys Lys Val Glu Arg Glu Thr Glu 

85 90 95 

Ala Asp Pro Phe Asp Thr Ser Ala Gly Arg Ser Gly Asp Ser Asp Glu 

100 105 HO 

Glu Leu Leu Lys Thr Val Arg Leu He Lys Phe Leu Tyr Gin Ser Asn 

115 120 125 

Pro Pro Pro Ser Asn Glu Gly Thr Arg Gin Ala Arg Arg Asn Arg Arg 

130 135 140 

Arg Arg Trp Arg Glu Arg Gin Arg Gin He Arg Ser He Ser Glu Arg 
145 150 155 160 

lie Leu Ser Thr Phe Leu Gly Arg Pro Ala Glu Pro Val Pro Leu Gin 

1 65 170 175 

Leu Pro Pro Leu Glu Arg Leu Thr Leu Asp Cys Ser Glu Asp Cys Gly 

180 " 185 190 

Asn Ser Gly Thr Gin Gly Val Gly Ser Pro Gin Val Leu Val Glu Ser 

195 200 205 

Pro Ala Val Leu Glu Pro Gly Thr Lys Glu Lys Leu Val Gly Lys Trp 

210 215 220 

Ser Lys Cys Ser Gly Trp Pro Thr Val Arg Glu Arg Met Lys Gin Ala 
225 230 235 240 

Glu Pro Glu Pro Ala Ala Asp Gly Val Gly Ala Ala Ser Arg Asp Leu 

245 250 255 

Glu Lys His Gly Ala He Thr Ser Ser Asn Thr Ala Thr Asn Asn Ala 

260 265 ' 270 

Ala Cys Ala Trp Leu Glu Ala Gin Glu Glu Glu Glu Val Gly Phe Pro 

275 280 285 

Val Arg Pro Gin Val Pro Leu Arg Pro Met Thr Tyr Lys Gly Ala Leu 

290 295 300 

Asp Leu Ser His Phe Leu Lys Glu Lys Gly Gly Leu Glu Gly Leu He 
305 310 315 3.0 
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Tyr Ser Pro Lys Arg Gin Glu He Leu Asp Leu Trp Val Tyr His Tnr 

325 330 335 

Gin Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr Pro Gly Pro Gly Val 

340 345 350 

Arg Tyr Pro Leu Thr Phe Gly Trp Cys Phe Lys Leu Val Pro Val Glu 

355 360 " 365 

Pro Asp Glu Glu Glu Asn Ser Ser Leu Leu His Pro Ala Ser Leu His 

370 375 380 

Gly Thr Glu Asp Thr Glu Arg Glu Val Leu Lys Trp Lys Phe Asp Ser 
385 390 395 400 

His Leu Ala Phe His His Lys Ala Arg Glu Leu His Pro Glu Tyr Tyr 

405 410 415 

Lys Asp Cys Ala Ala Val He Thr Leu Trp Gin Arg Pro Leu Val Ala 

420 425 ~* 430 

Leu He Glu lie Cys Thr Glu Met Glu Lys Glu Gly Lys He Ser Lys 

435 440 445 

He Gly Pro Ala Gly Leu Lys Lys Lys Lys Ser Val Thr Val Leu Asp 

450 455 460 

Val Gly Asp Ala Tyr Phe Ser Val Pro Leu Asp Lys Asp Phe Arg Lys 
465 470 475 480 

Tyr Thr Ala Phe Thr He Pro Ser He Trp Lys Gly Ser Pro Ala He 

485 490 ~ 495 

Phe Gin Ser Ser Met Thr Lys Lys Gin Asn Pro Asp lie Val He Tyr 

500 505 510 

Gin Tyr Met Asp Asp Leu Tyr Val Pro lie Val Leu Pro Glu Lys Asp 

515 520 525 

Ser Trp Leu Val Gly Lys Leu Asn Trp Ala Ser Gin lie Tyr Ala Gly 

530 535 540 ■ 

He Lys Val Lys Gin Leu He Leu Lys Glu Pro Val His Gly Val Tyr 
545 550 555 560 

Glu Pro He Val Gly Ala Glu Thr Phe Tyr Val Asp Gly Ala Ala Asn 

565 570 575 

Arg Ala Gly Asn Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp 

580 585 * 590 

Lys Glu Ala Thr Thr Thr Leu Val Glu Arg Tyr Leu Arg Asp Gin Gin 

595 600 605 

Leu Leu Gly lie Trp Gly Cys Ala Cys Thr Pro Tyr Asp lie Asn Gin 

610 615 620 

Met Leu Arg Gly Pro Gly Arg Ala Phe Val Thr lie Arg Gin Gly Ser 
625 630 635 640 

Leu Ala Ala Val Gly Ala Arg Ala Ser Val Leu Ser Gly Gly Glu Leu 

645 650 655 

Asp Lys Trp Glu Lys lie Arg Leu Arg Pro Gly Gly Lys Lys Lys Tyr 

660 665 670 

Gin Leu Lys His lie Val Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala 

675 680 685 

Val Asn Pro Gly Leu Leu Glu Thr Ser Glu Gly Cys Arg Gin He Met 

690 695 700 

Gly Gin Leu Gin Pro Ser Leu Gin Thr Gly Ser Glu Glu Leu Arq Ser 
705 710 7i5 * 720 

Leu Tyr Asn Thr Val Ala Thr Leu Tyr Cys Val His Gin Lys He Glu 

725 730 735 

Val Lys Asp Thr Lys Glu Ala Leu Asp Lys Val Glu Glu Glu Gin Asn 

740 745 750 

Asn Ser Lys Lys Lys Ala Gin Gin Glu Ala Ala Asp Ala Gly Asn Arq 

755 760 765 

Asn Gin Val Ser Gin Asn Tyr Pro lie Val Gin Asn Leu Gin Gly Gin 

770 775 780 

Met Val His Gin Ala He Ser Pro Arg Thr Leu Asn Ala Trp Val Lys 
785 790 " 795 800 

Val Val Glu Glu Lys Ala Phe Ser Pro Glu Val lie Pro Met Phe Ser 

805 810 815 

Ala Leu Ser Glu Gly Ala Thr Pro Gin Asp Leu Asn Thr Met Leu Asn 



-47- 



WO 02/090558 



PCT/FI02/00379 



820 825 830 

Thr Val Gly Gly His Gin Ala Ala Met Gin Met Leu Lys Glu Thr lie 

835 840 845 

Asn Glu Glu Ala Ala Glu Trp Asp Arg Leu His Pro Val His Ala Gly 

850 855 860 

Pro lie Ala Pro Gly Gin Met Arg Glu Pro Arg Gly Ser Asp He Ala 
865 870 • 875 880 

Glv Thr Thr Ser Thr Leu Gin Glu Gin He Gly Trp Met Thr Asn Asn 

885 890 895 

Pro Pro He Pro Val Gly Glu He Tyr Lys Arg Trp He He Leu Gly 

900 ~ 905 910 

Leu Asn Lys He Val Arg Met Tyr Ser Pro Thr Ser He Leu Asp He 

915 920 925 

Lvs Gin Gly Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr 

y 930 935 940 

Lvs Thr Leu Arg Ala Glu Gin Ala Thr Gin Glu Val Lys Asn Trp Met 
945 950 955 960 

Thr Glu Thr Leu Leu Val Gin Asn Ala Asn Pro Asp Cys Lys Thr He 

965 970 975 

Leu Lys Ala Leu Gly Pro Ala Ala Thr Leu Glu Glu Met Met Thr Ala 

980 985 990 

Cys Gin Gly Val Gly Gly Pro Gly His Lys Ala Arg Val Leu 
995 1000 1005 

<210> 47 
<211> 1006 
<212> PRT 

<213> Artificial Sequence 

<223> Hybrid protein comprised of Rev-Nef-Tat, CTL and truncated Gag 
protein (RNT-CTL-optpl7/24) 

Met°Ala 4 Gly Arg Ser Gly Asp Ser Asp Glu Glu Leu Leu Lys Thr Val 

1 5 10 15 

Arg Leu He Lys Phe Leu Tyr Gin Ser Asn Pro Pro Pro Ser Asn Glu 

20 25 30 

Gly Thr Arg Gin Ala Arg Arg Asn Arg Arg Arg Arg Trp Arg Glu Arg 

35 40 45 

Gin Arg Gin He Arg Ser He Ser Glu Arg lie Leu Ser Thr Phe Leu 

50 55 60 

Glv Arq Pro Ala Glu Pro Val Pro Leu Gin Leu Pro Pro Leu Glu Arg 
65 ■ 70 75 80 

Leu Thr Leu Asp Cys Ser Glu Asp Cys Gly Asn Ser Gly Thr Gin Gly 

85 90 95 

Val Gly Ser Pro Gin Val Leu Val Glu Ser Pro Ala Val Leu Glu Pro 

100 105 HO 

Gly Thr Lys Glu Thr Ser Val Gly Lys Trp Ser Lys Cys Ser Gly Trp 

115 • 120 125 

Pro Thr Val Arg Glu Arg Met Lys Gin Ala Glu Pro Glu Pro Ala Ala 

130 " 135 140 

Asn Glv Val Gly Ala Ala Ser Arg Asp Leu Glu Lys His Gly Ala He 
14 5 150 ~ 155 160 

Thr Ser Ser Asn Thr Ala Thr Asn Asn Ala Ala Cys Ala Trp Leu Glu 

165 170 175 

Ala Gin Glu Glu Glu Glu Val Gly Phe Pro Val Arg Pro Gin Val Pro 

180 185 190 

Leu Arg Pro Met Thr Tyr Lys Gly Ala Leu Asp Leu Ser His Phe Leu 

195 200 205 

Lys Glu Lys Gly Gly Leu Glu Gly Leu He Tyr Ser Pro Lys Arg Gin 

210 " 215 220 

Glu He Leu Asp Leu Trp Val Tyr His Thr Gin Gly Tyr Phe Pro Asp 
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225 230 235 2 4U 

Trp Gin Asn Tyr Thr Pro Gly Pro Gly Val Arg Tyr Pro Leu Thr Phe 

245 250 255 

Gly Trp Cys Phe Lys Leu Val Pro Val Glu Pro Asp Glu Glu Glu Asn 

260 265 270 

Ser Ser Leu Leu His Pro Ala Ser Leu His Gly Thr Glu Asp Thr Glu 

275 280 265 

Arg Glu Val Leu Lys Trp Lys Phe Asp Ser His Leu Ala Phe His His 

250 295 300 

Lys Ala Arg Glu Leu His Pro Glu Tyr Tyr Lys Asp Cys Lys Leu Glu 
305 310 315 320 

Pro Val Asp Pro Arg Leu Glu Pro Trp Lys His Pro Gly Ser Gin Pro 

325 330 " 335 

Arg Thr Pro Cys Thr Asn Cys Tyr Cys Lys Lys Cys Cys Leu His Cys 

340 345 350 

Gin Val Cys Phe Thr Arg Lys Gly Leu Gly lie Ser Tyr Gly Arg Lys 

355 360 365 

Lys Arg Arg Gin Arg Arg Arg Ala Pro Gin Asp Ser Gin Thr His Gin 

370 375 380 

Val Ser Leu Pro Lys Gin Pro Ser Ser Gin Gin Arg Gly Asp Pro Thr 
385 390 395 400 

Gly Pro Lys Lys Ser Lys Lys Lys Val Glu Arg Glu Thr Glu Ala Asp 

405 410 415 

Pro Phe Asp Ala Ala Val lie Thr Leu Trp Gin Arg Pro Leu Val Ala 

420 425 430 

Leu He Glu He Cys Thr Glu Met Glu Lys Glu Gly Lys He Ser Lys 

435 440 445 

He Gly Pro Ala Gly Leu Lys Lys Lys Lys Ser Val Thr Val Leu Asp 

450 455 46Q 

Val Gly Asp Ala Tyr Phe Ser Val Pro Leu Asp Lys Asp Phe Arg Lys 
465 470 475 480 

Tyr Thr Ala Phe Thr lie Pro Ser He Trp Lys Gly Ser Pro Ala He 

485 - 490 495 

Phe Gin Ser Ser Met Thr Lys Lys Gin Asn Pro Asp He Val lie Tyr 

500 505 510 

Gin Tyr Met Asp Asp Leu Tyr Val Pro lie Val Leu Pro Glu Lys Asp 

515 520 525 

Ser Trp Leu Val Gly Lys Leu Asn Trp Ala Ser Gin lie Tyr Ala Gly 

530 535 " 540 

He Lys Val Lys Gin Leu He Leu Lys Glu Pro Val His Gly Val Tyr 
545 550 555 * 5 60 

Glu Pro lie Val Gly Ala Glu Thr Phe Tyr Val Asp Gly Ala Ala Asn 

565 570 " 575 

Arg Ala Gly Asn Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp 

580 585 590 

Lys Glu Ala Thr Thr Thr Leu Val Glu Arg Tyr Leu Arg Asp Gin Gin 

595 600 6 05 

Leu Leu Gly lie Trp Gly Cys Ala Cys Thr Pro Tyr Asp He Asn Gin 

*10 615 620 

Met Leu Arg Gly Pro Gly Arg Ala Phe Val Thr lie Arg Gin Gly Ser 
625 630 635 640 

Leu Ala Ala Val Gly Ala Arg Ala Ser Val Leu Ser Gly Gly Glu Leu 

645 650 655 

Asp Lys Trp Glu Lys He Arg Leu Arg Pro Gly Gly Lys Lys Lys Tyr 

660 665 670 

Gin Leu Lys His lie Val Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala 

675 680 685 

Val Asn Pro Gly Leu Leu Glu Thr Ser Glu Gly Cys Arg Gin He Met 

690 695 voo 

Gly Gin Leu Gin Pro Ser Leu Gin Thr Gly Ser Glu Glu Leu Arq Ser 
705 710 715 720 

Leu Tyr Asn Thr Val Ala Thr Leu Tyr Cys Val His Gin Lys lie Glu 
725 730 735 



-49- 



WO 02/090558 



PCT/FI02/00379 



Val Lys Asp Thr Lys Glu Ala Leu Asp Lys Val Glu Glu Glu Gin Asn 

740 745 750 

Asn Ser Lys Lys Lys Ala Gin Gin Glu Ala Ala Asp Ala Gly Asn Arg 

755 760 765 

Asn Gin Val Ser Gin Asn Tyr Pro He Val Gin Asn Leu Gin Gly Gin 

770 775 780 

Met Val His Gin Ala He Ser Pro Arg Thr Leu Asn Ala Trp Val Lys 
785 790 795 800 

Val Val Glu Glu Lys Ala Phe Ser Pro Glu Val He Pro, Met Phe Ser 

805 810 815 

Ala Leu Ser Glu, Gly Ala Thr Pro Gin Asp Leu Asn Thr Met Leu Asn 

820 825 830 

Thr Val Gly Gly His Gin Ala Ala Met Gin Met Leu Lys Glu Thr He 

835 840 845 

Asn Glu Glu Ala Ala Glu Trp Asp Arg Leu His Pro Val His Ala Gly 

850 855 860 

Pro He Ala Pro, Gly Gin Met Arg Glu Pro Arg Gly Ser Asp He Ala 
865 870 875 880 

Gly Thr Thr Ser Thr Leu Gin Glu Gin He Gly Trp Met Thr Asn Asn 

885. 890 895 

Pro Pro He Pro Val Gly Glu He Tyr Lys Arg Trp He He Leu Gly 

900 905 910 

Leu Asn Lys He Val Arg Met Tyr Ser Pro Thr Ser He Leu Asp He 

915 920 925 

Lys Gin Gly Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr 

930 935 940 

Lys Thr Leu Arg Ala Glu Gin Ala Thr Gin Glu Val Lys Asn Trp Met 
945 " 950 955 960 

Thr Glu Thr Leu Leu Val Gin Asn Ala Asn Pro Asp Cys Lys Thr He 

965 970 975 

Leu Lys Ala Leu Gly Pro Ala Ala Thr Leu Glu Glu Met Met Thr Ala 

980 985 990 

Cys Gin Gly Val Gly Gly Pro Gly His Lys Ala Arg Val Leu 
995 1000 1005 



<210> 48 

<211> 1006 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Hybrid protein comprised of 
CTL (RNT-optpl7/24-CTL) 



<400> 48 
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35 










40 


Gin 


Arg Gin 


He 


Arg 


Ser 


He 


Ser Glu 




50 








55 




Gly 


Arg Pro 


Ala 


Glu 


Pro 


Val 


Pro Leu 


65 








70 






Leu 


Thr Leu 


Asp 


Cys 


Ser 


Glu 


Asp Cys 








85 
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-Nef-Tat, truncated Gag protein and 
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80 
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Asp Gly Val Gly Ala Ala Ser Arg Asp Leu Glu Lys His Gly Ala lie 

It 5 c * 150 155 16 <> 

Thr Ser Ser Asn Thr Ala Thr Asn Asn Ala Ala Cys Ala Trp Leu Glu 

165 170 175 

Ala Gin Glu Glu Glu Glu Val Gly Phe Pro Val Arg Pro Gin Val Pro 

180 185 190 

Leu Arg Pro Met Thr Tyr Lys Gly Ala Leu Asp Leu Ser His Phe Leu 

195 200 205 

Lys Glu Lys Gly Gly Leu Glu Gly Leu lie Tyr Ser Pro Lys Arg Gin 

210 215 220 

Glu He Leu Asp Leu Trp Val Tyr His Thr Gin Gly Tyr Phe Pro Asp 
225 230 235 240 

Trp Gin Asn Tyr Thr Pro Gly Pro Gly Val Arg Tyr Pro Leu Thr Phe 

245 250 255 

Gly Trp Cys Phe Lys Leu Val Pro Val Glu Pro Asp Glu Glu Glu Asn 

260 265 270 

Ser Ser Leu Leu His Pro Ala Ser Leu His Gly Thr Glu Asp Thr Glu 

275 280 285 

Arg Glu Val Leu Lys Trp Lys Phe Asp Ser His Leu Ala Phe His His 

290 ' 295 300 

Lys Ala Arg Glu Leu His Pro Glu Tyr Tyr Lys Asp Cys Lys Leu Glu 
305 310 315 320 

Pro Val Asp Pro Arg Leu Glu Pro Trp Lys His Pro Gly Ser Gin Pro 

325 330 * 335 

Arg Thr Pro Cys Thr Asn Cys Tyr Cys Lys Lys Cys Cys Leu His Cys 

340 345 350 

Gin Val Cys Phe Thr Arg Lys Gly Leu Gly He Ser Tyr Gly Arg Lys 

355 360 365 

Lys Arg Arg Gin Arg Arg Arg Ala Pro Gin Asp Ser Gin Thr His Gin 

370 375 380 

Val Ser Leu Pro Lys Gin Pro Ser Ser Gin Gin Arg Gly Asp Pro Thr 
I* 5 390 395 " 400 

Gly Pro Lys Lys Ser Lys Lys Lys Val Glu Arg Glu Thr Glu Ala Asp 

405 410 415 

Pro Phe Asp Ala Ala Val Gly Ala Arg Ala Ser Val Leu Ser Gly Gly 

420 425 430 

Glu Leu Asp Lys Trp Glu Lys He Arg Leu Arg Pro Gly Gly Lys Lys 

435 440 445 

Lys Tyr Gin Leu Lys His He Val Trp Ala Ser Arg Glu Leu Glu Arg 

450 455 460 

Phe Ala Val Asn Pro Gly Leu Leu Glu Thr Ser Glu Gly Cys Arg Gin 
\ 65 470 475 * " 480 

He Met Gly Gin Leu Gin Pro Ser Leu Gin Thr Gly Ser Glu Glu Leu 

485 490 495 

Arg Ser Leu Tyr Asn Thr Val Ala Thr Leu Tyr Cys Val His Gin Lys 

500 505 510 

He Glu Val Lys Asp Thr Lys Glu Ala Leu Asp Lys Val Glu Glu Glu 

• 515 520 525 

Gin Asn Asn Ser Lys Lys Lys Ala Gin Gin Glu Ala Ala Asp Ala Gly 

530 535 540 

Asn Arg Asn Gin Val Ser Gin Asn Tyr Pro He Val Gin Asn Leu Gin 
5 ? 5 , 550 555 560 

Gly Gin Met Val His Gin Ala He Ser Pro Arg Thr Leu Asn Ala Trp 

565 570 575 

Val Lys Val Val Glu Glu Lys Ala Phe Ser Pro Glu Val He Pro Met 

580 585 590 

Phe Ser Ala Leu Ser Glu Gly Ala Thr Pro Gin Asp Leu Asn Thr Met 

595 600 605 

Leu Asn Thr Val Gly Gly His Gin Ala Ala Met Gin Met Leu Lys Glu 

610 615 6 20 

Thr He Asn Glu Glu Ala Ala Glu Trp Asp Arg Leu His Pro Val His 
6 J S , 630 635 640 

Ala Gly Pro He Ala Pro Gly Gin Met Arg Glu Pro Arg Gly Ser Asp 
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645 650 655 

lie Ala Gly Thr Thr Ser Thr Leu Gin Glu Gin lie Gly Trp Met Thr 

660 665 670 

Asn Asn Pro Pro He Pro Val Gly Glu He Tyr Lys Arg Trp He He 

• 675 680 685 

Leu Gly Leu Asn Lys He Val Arg Met Tyr Ser Pro Thr Ser He Leu 

690 695 700 

Asp lie Lys Gin Gly Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg 
705 710 715 720 

Phe Tyr Lys Thr Leu Arg Ala Glu Gin Ala Thr Gin Glu Val Lys Asn - 

725* 730 735 

Trp Met Thr Glu Thr Leu Leu Val Gin Asn Ala Asn Pro Asp Cys Lys 

740 745 750 

Thr He Leu Lys Ala Leu Gly Pro Ala Ala Thr Leu Glu Glu Met Met 

755 760 765 

Thr Ala Cys Gin Gly Val Gly Gly Pro Gly His Lys Ala Arg Val Leu 

770 775 780 

Ala Ala Val He Thr Leu Trp Gin Arg Pro Leu Val Ala Leu He Glu 
785 790 795 800 

He Cys Thr Glu Met Glu Lys Glu Gly Lys He Ser Lys lie Gly Pro 

805 810 815 

Ala Gly Leu Lys Lys Lys Lys Ser Val Thr Val Leu Asp Val Gly Asp 

820 825 830 

Ala Tyr Phe Ser Val Pro Leu Asp Lys Asp Phe Arg Lys Tyr Thr Ala 

835 840 845 ; 

Phe Thr lie Pro Ser He Trp Lys Gly Ser Pro Ala lie Phe Gin Ser 

850 855 860 

Ser Met Thr Lys Lys Gin Asn Pro Asp He Val He Tyr Gin Tyr Met 
865 870 875 880 

Asp Asp Leu Tyr Val Pro He Val Leu Pro Glu Lys Asp Ser Trp Leu 

885 890 895 

Val Gly Lys Leu Asn Trp Ala Ser Gin He Tyr Ala Gly He Lys Val 

900 90S 910 

Lys Gin Leu lie Leu Lys Glu Pro Val His Gly Val Tyr Glu Pro lie 

915 920 925 

Val Gly Ala Glu Thr Phe Tyr Val Asp Gly Ala Ala Asn Arg Ala Gly 

930 935 940 

Asn Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala 
945 * 950 955 960 

Thr Thr Thr Leu Val Glu Arg Tyr Leu Arg Asp Gin Gin Leu Leu Gly 

965 " ~ 970 975 

He Trp Gly Cys Ala Cys Thr Pro Tyr Asp He Asn Gin Met Leu Arg 

980 " 985 990 

Gly Pro Gly Arg Ala Phe Val Thr He Arg Gin Gly Ser Leu 
995 1000 1005 



<210> 49 
<211> 7945 
<212> DNA 

<213> Bovine papillomavirus 



<220> 

<221> misc_feature 

<222> 1205~ 

<223> n = A,T,C or G 



<400> 49 

gttaacaata atcacaccat caccgttttt 
aaaaagctgc tgacagaccc cggttttcac 
ccattctcag ggttggattg tctgtggtgc 
aggtgcatgg tcaaagactt tcatgttgta 
accatttgtc ttgaaaactg tttagctact 
acaggtgagg aagctgaatt attgcatggc 



tcaagcggga aaaaatagcc agctaactat 60 
atggacctga aaccttttgc aagaaccaat 120 
agagagcctc ttacagaagt tgatgctttt 180 
attcgggaag gctgtagata tggtgcatgt 240 
gaaagaagac tttggcaagg tgttccagta 300 
aaaacacttg ataggctttg cataagatgc 360 
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tgctactgtg ggggcaaact aacaaaaaat gaaaaacatc ggcatgtgct ttttaatgag 420 
cctttc.tgca aaaccagagc taacataatt agaggacgct gctacgactg ctgcagacat 480 
ggttcaaggt ccaaataccc atagaaactt ggatgattca cctgcaggac cgttgctgat 540 
tttaagtcca tgtgcaggca cacctaccag gtctcctgca gcacctgatg cacctgattt 600 
cagacttccg tgccatttcg gccgtcctac taggaagcga ggtcccacta cccctccgct 660 
ttcctctccc ggaaaactgt gtgcaacagg gccacgtcga gtgtattctg tgactgtctg 720 
ctgtggaaac tgcggaaaag agctgacttt tgctgtgaag accagctcga cgtccctgct 780 
tggatttgaa caccctttaa actcagattt agacctcttg tgtccacgtt gtgaatctcg 840 
cgagcgtcat ggcaaacgat aaaggtagca attgggattc gggcttggga tgctcatatc 900 
tgctgactga ggcagaatgt gaaagtgaca aagagaatga ggaacccggg gcaggtgtag 960 
aactgtctgt ggaatctgat cggtatgata gccaggatga ggattttgtt gacaatgcat 1020 
cagtctttca gggaaatcac ctggaggtct tccaggcatt agagaaaaag gcgggtgagg 1080 
agcagatttt aaatttgaaa agaaaagtat tggggagttc gcaaaacagc agcggttccg 1140 
aagcatctga aactccagtt aaaagacgga aatcaggagc aaagcgaaga ttatttgctg 1200 
aaaangaagc taaccgtgtt cttacgcccc tccaggtaca gggggagggg gaggggaggc 1260 
aagaacttaa tgaggagcag gcaattagtc atctacatct gcagcttgtt aaatctaaaa 1320 
atgctacagt ttttaagctg gggctcttta aatctttgtt cctttgtagc ttccatgata 1380 
ttacgaggtt gtttaagaat gataagacca ctaatcagca atgggtgctg gctgtgtttg 1440 
gccttgcaga ggtgtttttt gaggcgagtt tcgaactcct aaagaagcag tgtagttttc 1500 
tgcagatgca aaaaagatct catgaaggag gaacttgtgc agtttactta atctgcttta 1560 
■ acacagctaa aagcagagaa acagtccgga atctgatggc aaacacgcta aatgtaagag 1620 
aagagtgttt gatgctgcag ccagctaaaa ttcgaggact cagcgcagct ctattctggt 1680 
ttaaaagtag tttgtcaccc gctacactta aacatggtgc tttacctgag tggatacggg 1740 
cgcaaactac tctgaacgag agcttgcaga ccgagaaatt cgacttcgga actatggtgc 1800 
aatgggccta tgatcacaaa tatgctgagg agtctaaaat agcctatgaa tatgctttgg 1860 
ctgcaggatc tgatagcaat gcacgggctt ttttagcaac taacagccaa gctaagcatg 1920 
tgaaggactg tgcaactatg gtaagacact atctaagagc tgaaacacaa gcattaagca 1980 
tgcctgcata tattaaagct aggtgcaagc tggcaactgg ggaaggaagc tggaagtcta 2040 
tcctaacttt ttttaactat cagaatattg aattaattac ctttattaat gctttaaagc 2100 
tctggctaaa aggaattcca aaaaaaaact gtttagcatt tattggccct ccaaacacag 2160 
gcaagtctat gctctgcaac tcattaattc attttttggg tggtagtgtt ttatcttttg 2220 
ccaaccataa aagtcacttt tggcttgctt ccctagcaga tactagagct gctttagtag 2280 
atgatgctac tcatgcttgc tggaggtact ttgacacata cctcagaaat gcattggatg 2340 
gctaccctgt cagtattgat agaaaacaca aagcagcggt tcaaattaaa gctccacccc 24 00 
tcctggtaac cagtaatatt gatgtgcagg cagaggacag atatttgtac ttgcatagtc 2460 
gggfcgcaaac ctttcgcttt gagcagccat gcacagatga atcgggtgag caacctttta 2520 
atattactga tgcagattgg aaatcttttt ttgtaaggtt atgggggcgt ttagacctga 2580 
ttgacgagga ggaggatagt gaagaggatg gagacagcat gcgaacgttt acatgtagcg 2640 
caagaaacac aaatgcagtt gattgagaaa agtagtgata agttgcaaga tcatatactg 2700 
tactggactg ctgttagaac tgagaacaca ctgctttatg ctgcaaggaa aaaaggggtg 2760 
actgtcctag gacactgcag agtaccacac tctgtagttt gtcaagagag agccaagcag 2820 
gccattgaaa tgcagttgtc tttgcaggag ttaagcaaaa ctgagtttgg ggatgaacca 2880 
tggtctttgc ttgacacaag ctgggaccga tatatgtcag aacctaaacg gtgctttaag 2940 
aaaggcgcca gggtggtaga ggtggagttt gatggaaatg caagcaatac aaactggtac 3000 
actgtctaca gcaatttgta catgcgcaca gaggacggct ggcagcttgc gaaggctggg 3060 
gctgacggaa ctgggctcta ctactgcacc atggccggtg ctggacgcat ttactattct 3120 
cgctttggtg acgaggcagc cagatttagt acaacagggc attactctgt aagagatcag 3180 
gacagagtgt atgctggtgt ctcatccacc tcttctgatt ttagagatcg cccagacgga 3240 
gtctgggtcg catccgaagg acctgaagga gaccctgcag gaaaagaagc cgagccagcc 33 00 
cagcctgtct cttctttgct cggctccccc gcctgcggtc ccatcagagc aggcctcggt 3360 
tgggtacggg acggtcctcg ctcgcacccc tacaattttc ctgcaggctc ggggggctct 3420 
attctccgct cttcctccac cccgtgcagg gcacggtacc ggtggacttg gcatcaaggc 3480 
aggaagaaga ggagcagtcg cccgactcca cagaggaaga accagtgact ctcccaaggc 3540 
gcaccaccaa tgatggattc cacctgttaa aggcaggagg gtcatgcttt gctctaattt 3600 
caggaactgc taaccaggta aagtgctatc gctttcgggt gaaaaagaac catagacatc 3660 
gctacgagaa ctgcaccacc acctggttca cagttgctga caacggtgct gaaagacaag 372 0 
gacaagcaca aatactgatc acctttggat cgccaagtca aaggcaagac tttctgaaac 3780 
atgtaccact acctcctgga atgaacattt ccggctttac agccagcttg gacttctgat 3840 
cactgccatt gccttttctt catctgactg gtgtactatg ccaaatctat ggtttctatt 3900 
gttcttggga ctagttgctg caatgcaact gctgctatta ctgttcttac tcttgttttt 3 960 
tcttgtatac tgggatcatt ttgagtgctc ctgtacaggt ctgccctttt aatgccttta 4020 
catcactggc tattggctgt .gtttttactg ttgtgtggat ttgatttgtt ttatatactg 4080 
tatgaagttt tttcatttgt gcttgtattg ctgtttgtaa gttttttact agagtttgta 4140 
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ttccccctgc tcagatttta tatggtttaa gctgcagcaa taaaaatgag tgcacgaaaa 4200 
agagtaaaac gtgccagtgc ctatgacctg tacaggacat gcaagcaagc gggcacatgt 4260 
ccaccagatg tgataccaaa ggtagaagga gatactatag cagataaaat tttgaaattt 4320 
gggggtcttg caatctactt aggagggcta ggaataggaa catggtctac tggaagggtt 4380 
gctgcaggtg gatcaccaag gtacacacca ctccgaacag cagggtccac atcatcgctt 4440 
gcatcaatag gatccagagc tgtaacagca gggacccgcc ccagtatagg tgcgggcatt 4500 
cctttagaca cccttgaaac tcttggggcc ttgcgtccag gggtgtatga ggacactgtg 4560 
ctaccagagg cccctgcaat agtcactcct gatgctgttc ctgcagattc agggcttgat 4620 
gccctgtcca taggtacaga ctcgtccacg gagaccctca ttactctgct agagcctgag 4680 
ggtcccgagg acatagcggt tcttgagctg caacccctgg accgtccaac ttggcaagta 4740 
agcaatgctg ttcatcagtc ctctgcatac cacgcccctc tgcagctgca atcgtccatt 4800 
gcagaaacat ctggtttaga aaatattttt gtaggaggct cgggtttagg ggatacagga 4860 
ggagaaaaca ttgaactgac atacttcggg tccccacgaa caagcacgcc ccgcagtatt 4920 
gcctctaaat cacgtggcat tttaaactgg ttcagtaaac ggtactacac acaggtgccc 4980 
acggaagatc ctgaagtgtt ttcatcccaa acatttgcaa acccactgta tgaagcagaa 5040 
ccagctgtgc ttaagggacc tagtggacgt gttggactca gtcaggttta taaacctgat 5100 
acacttacaa cacgtagcgg gacagaggtg ggaccacagc tacatgtcag gtactcattg 5160 
agtactatac atgaagatgt agaagcaatc ccctacacag ttgatgaaaa tacacaggga 5220 
cttgcattcg tacccttgca tgaagagcaa gcaggttttg aggagataga attagatgat 5280 
tttagtgaga cacatagact gctacctcag aacacctctt ctacacctgt tggtagtggt 5340 
gtacgaagaa gcctcattcc aactcaggaa tttagtgcaa cacggcctac aggtgttgta 5400 
acctatggct cacctgacac ttactctgct agcccagtta ctgaccctga ttctacctct 5460 
cctagtctag ttatcgatga cactactact acaccaatca ttataattga tgggcacaca 5520 
gttgatttgt acagcagtaa ctacaccttg catccctcct tgttgaggaa acgaaaaaaa 5580 
cggaaacatg cctaattttt tttgcagatg gcgttgtggc aacaaggcca gaagctgtat 5640 
ctccctccaa cccctgtaag caaggtgctt tgcagtgaaa cctatgtgca aagaaaaagc 5700 
attttttatc atgcagaaac ggagcgcctg ctaactatag gacatccata ttacccagtg 5760 
tctatcgggg ccaaaactgt tcctaaggtc tctgcaaatc agtatagggt atttaaaata 5820 
caactacctg atcccaatca atttgcacta cctgacagga ctgttcacaa cccaagtaaa 5880 
gagcggctgg tgtgggcagt cataggtgtg caggtgtcca gagggcagcc tcttggaggt 5940 
actgtaactg ggcaccccac ttttaatgct ttgcttgatg cagaaaatgt gaatagaaaa 6000 
gtcaccaccc aaacaacaga tgacaggaaa caaacaggcc tagatgctaa gcaacaacag 6060 
attctgttgc taggctgtac ccctgctgaa ggggaatatt ggacaacagc ccgtccatgt 6120 
gttactgatc gtctagaaaa tggcgcctgc cctcctcttg aattaaaaaa caagcacata 6180 
gaagatgggg atatgatgga aattgggttt ggtgcagcca acttcaaaga aattaatgca 6240 
agtaaatcag atctacctct tgacattcaa aatgagatct gcttgtaccc agactacctc 6300 
aaaatggctg aggacgctgc tggtaatagc atgttctttt ttgcaaggaa agaacaggtg 6360 
tatgttagac acatctggac cagagggggc tcggagaaag aagcccctac cacagatttt 6420 
tatttaaaga ataataaagg ggatgccacc cttaaaatac ccagtgtgca ttttggtagt 6480 
cccagtggct cactagtctc aactgataat caaattttta atcggcccta ctggctattc 6540 
cgtgcccagg gcatgaacaa tggaattgca tggaataatt tattgttttt aacagtgggg 6600 
gacaatacac gtggtactaa tcttaccata agtgtagcct cagatggaac cccactaaca 6660 
gagtatgata gctcaaaatt caatgtatac catagacata tggaagaata taagctagcc 6720 
tttatattag agctatgctc tgtggaaatc acagctcaaa ctgtgtcaca tctgcaagga 6780 
cttatgccct ctgtgcttga aaattgggaa ataggtgtgc agcctcctac ctcatcgata 6840 
ttagaggaca cctatcgcta tatagagtct cctgcaacta aatgtgcaag caatgtaatt 6900 
cctgcaaaag aagaccctta tgcagggttt aagttttgga acatagatct taaagaaaag 6960 
ctttctttgg acttagatca atttcccttg ggaagaagat ttttagcaca gcaaggggca 7020 
ggatgttcaa ctgtgagaaa acgaagaatt agccaaaaaa cttccagtaa gcctgcaaaa 7080 
aaaaaaaaaa aataaaagct aagtttctat aaatgttctg taaatgtaaa acagaaggta 7140 
agtcaactgc acctaataaa aatcacttaa tagcaatgtg ctgtgtcagt tgtttattgg 7200 
aaccacaccc ggtacacatc ctgtccagca tttgcagtgc gtgcattgaa ttattgtgct 7260 
ggctagactt catggcgcct ggcaccgaat cctgccttct cagcgaaaat gaataattgc 7320 
tttgttggca agaaactaag catcaatggg acgcgtgcaa agcaccggcg gcggtagatg 7380 
cggggtaagt actgaatttt aattcgacct atcccggtaa agcgaaagcg acacgctttt 7440 
ttttcacaca tagcgggacc gaacacgtta taagtatcga ttaggtctat ttttgtctct 7500 
ctgtcggaac cagaactggt aaaagtttcc attgcgtctg ggcttgtcta tcattgcgtc 7560 
tctatggttt ttggaggatt agacggggcc accagtaatg gtgcatagcg gatgtctgta 7620 
ccgccatcgg tgcaccgata taggtttggg gctccccaag ggactgctgg gatgacagct 7680 
tcatattata ttgaatgggc gcataatcag cttaattggt gaggacaagc tacaagttgt 7740 
aacctgatct ccacaaagta cgttgccggt cggggtcaaa ccgtcttcgg tgctcgaaac 7800 
cgccttaaac tacagacagg tcccagccaa gtaggcggat caaaacctca aaaaggcggg 7860 
agccaatcaa aatgcagcat tatattttaa gctcaccgaa accggtaagt aaagactatg 7920 
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tattttttcc cagtgaataa ttgtt 

<210> 50 
<211> 306 
<212> PRT 

<213> bovine papillomavirus type l 
<400> 50 

Met Glu Thr Ala Cys Glu Arg Leu His Val Ala Gin Glu Thr Gin Met 

10 15 
Gin Leu lie Glu Lys Ser Ser Asp Lys Leu Gin Asp His lie Leu Tyr 

20 25 ' 20 

Trp Thr Ala Val Arg Thr Glu Asn Thr Leu Leu Tyr Ala Ala Arg Lys 

35 40 45 

Lys Gly Val Thr Val Leu Gly His Cys Arg Val Pro His Ser Val Val 

50 55 go 

Cys Gin Glu Arg Ala Lys Gin Ala lie Glu Met Gin Leu Ser Leu Gin 

Glu Leu Ser Lys Thr Glu Phe Gly Asp Glu Pro Trp Ser Leu Leu Asp 
_ 85 90 * 95 

Thr ser Trp Asp Arg Tyr Met Ser Glu Pro Lys Arg Cys Phe Lys Lys 
100 in? 1 



iuu 105 " no 

Gly Ala Arg Val Val Glu Val Glu Phe Asp Gly Asn Ala Ser Asn Thr 

115 120 125 

Asn Trp Tyr Thr Val Tyr Ser Asn Leu Tyr Met Arg Thr Glu Asp Gly 

Trp Gin Leu Ala Lys Ala Gly Ala Asp Gly Thr o" Leu Tyr Tyr Cys 



iDU 155 i cc\ 

Thr Met Ala Gly Ala Gly Arg lie Tyr Tyr Ser Arg Phe Gly Asp III 
165 170 --- 



Ala Ala Arg Phe Ser Thr Thr Gly His Tyr Ser Val Arg Asp Gin Asp 

0 185 lOQ 

Arg Val Tyr Ala Gly Val Ser Ser Thr Ser Ser Asp Phe Arg Asp Arg 

195 200 205 

Pro Asp Gly Val Trp Val Ala Ser Glu Gly Pro Glu Gly Asp Pro Ala 
^, 1U 215 220 

Gly Lys Glu Ala Glu Pro Ala Gin Pro Val Ser Ser Leu Leu Gly Ser 
230 235 2 4 A 

Pro Ala Cys Gly Pro lie Arg Ala Gly Leu Gly Trp Val Arg Asp Gly 

245 250 255 

Pro Arg Ser His Pro Tyr Asn Phe Pro Ala Gly Ser Gly Gly Ser He 

260 265 270 

Leu Arg Ser Ser Ser Thr Pro Cys Arg Ala Arg Tyr Arg Trp Thr Trp 

280 285 
His Gin Gly Arg Lys Lys Arg Ser Ser Arg Pro Thr Pro Gin Arg Lys 

295 mn 



Asn Gin 
305 



7945 



<210> 51 
<211> 622 
<212> DNA 

<213> Human herpesvirus 4 
<400> 51 

gggtatcata tgctgactgt atatgcatga 
ggatagcata tactacccag atatagatta 
ggatagccta tgctacccag atataaatta 
ggatagcata tgctacccag atatagatta 
ggatagcata tgctacccag atatagatta 
agtatatgct acccagatat aaattaggat 
agcatatgct acccggatac agattaggat 
agcatatgct acccagatat agattaggat 



ggatagcata tgctacccgg atacagatta 60 
ggatagcata tgctacccag atatagatta 120 
ggatagcata tactacccag atatagatta 180 
ggatagccta tgctacccag atatagatta 240 
ggatagcata tgctatccag atatttgggt 300 
agcatatact accctaatct ctattaggat 360 
agcatatact acccagatat agattaggat 420 
agcctatgct acccagatat aaattaggat 480 
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agcatatact acccagatat agattaggat agcatatgct acccagatat agattaggat 540 
agcctatgct acccagatat agattaggat agcatatgct atccagatat ttgggtagta 600 
tatgctaccc atggcaacat ta 622 

<210> 52 
<211> 641 
<212> PRT 

<213> Human herpesvirus 4 



<400> 52 



Met 


Ser 


Asp 


Glu 


Gly 


Pro 


Gly 


Thr 


Gly 


Pro 


Gly 


Asn 


Gly 


Leu 


Gly 


Glu 


1 








5 










10 










15 




Lys 


Gly 


Asp 


Thr 


Ser 


Gly 


Pro 


Glu 


Gly 


Ser 


Gly 


Gly 


Ser 


Gly 


Pro 


Gin 








20 










25 










30 






Arg 


Arg 


Gly 


Gly 


Asp 


Asn 


His 


Gly 


Arg 


Gly 


Arg 


Gly 


Arg 


Gly 


Arg 


Gly 






35 










40 










45 








Arg 


Gly 


Gly 


Gly 


Arg 


Pro 


Gly 


Ala 


Pro 


Gly 


Gly 


Ser 


Gly 


Ser 


Gly 


Pro 




50 










55 










60 










Arg 


His 


Arg 


Asp 


Gly 


Val 


Arg 


Arg 


Pro 


Gin 


Lys 


Arg 


Pro 


Ser 


Cys 


lie 


65 










70 










75 










80 


Gly 


Cys 


Lys 


Gly 


Thr 


His 


Gly 


Gly 


Thr 


Gly 


Ala 


Gly 


Ala 


Gly 


Ala 


Gly 










85 










90 










95 




Gly 


Ala 


Gly 


Ala 


Gly 


Gly 


Ala 


Gly 


Ala 


Gly 


Gly 


Gly 


Ala 


Gly 


Ala 


Gly 








100 










105 










110 






Gly 


Gly 


Ala 


Gly 


Gly 


Ala 


Gly 


Gly 


Ala 


Gly 


Gly 


Ala 


Gly 


Ala 


Gly 


Gly 






115 










120 










125 








Gly 


Ala 


Gly 


Ala 


Gly 


Gly 


Gly 


Ala 


Gly 


Gly 


Ala 


Gly 


Gly 


Ala 


Gly 


Ala 




130 










135 










140 










Gly 


Gly 


Gly 


Ala 


Gly 


Ala 


Gly 


Gly 


Gly 


Ala 


Gly 


Gly 


Ala 


Gly 


Ala 


Gly 


1 4.R 




















ICC 










160 


Gly 


Gly 


Ala 


Gly 


Gly 


Ala 


Gly 


Gly 


Ala 


Gly 


Ala 


Gly 


Gly 


Gly 


Ala 


Gly 










165 










170 










175 




Ala 


Gly 


Gly 


Gly 


Ala 


Gly 


Gly 


Ala 


Gly 


Ala 


Gly 


Gly 


Gly 


Ala 


Gly 


Gly 








180 










185 










190 






Ala 


Gly 


Gly 


Ala 


Gly 


Ala 


Gly 


Gly 


Gly 


Ala 


Gly 


Ala 


Gly 


Gly 


Ala 


Gly 






195 










200 










205 








Gly 


Ala 


Gly 


Gly 


Ala 


Gly 


Ala 


Gly 


Gly 


Ala 


Gly 


Ala 


Gly 


Gly 


Gly 


Ala 




210 










215 










220 










Gly 


Gly 


Ala 


Gly 


Gly 


Ala 


Gly 


Ala 


Gly 


Gly 


Ala 


Gly 


Ala 


Gly 


Gly 


Ala 


225 










230 










235 










240 


Gly 


Ala 


Gly 


Gly 


Ala 


Gly 


Ala 


Gly 


Gly 


Ala 


Gly 


Gly 


Ala 


Gly 


Ala 


Gly 










245 










250 










255 




Gly 


Ala 


Gly 


Gly 


Ala 


Gly 


Ala 


Gly 


Gly 


Ala 


Gly 


Gly 


Ala 


Gly 


Ala 


Gly 








260 










265 










270 






Gly 


Gly 


Ala 


Gly 


Gly 


Ala 


Gly 


Ala 


Gly 


Gly 


Gly 


Ala 


Gly 


Gly 


Ala 


Gly 






275 










280 










285 








Ala 


Gly 


Gly 


Ala 


Gly 


Gly 


Ala 


Gly 


Ala 


Gly 


Gly 


Ala 


Gly 


Gly 


Ala 


Gly 




290 










295 










300 










Ala 


Gly 


Gly 


Ala 


Gly 


Gly 


Ala 


Gly 


Ala 


Gly 


Gly 


Gly 


Ala 


Gly 


Ala 


Gly 


305 










310 










315 










320 


Gly 


Ala 


Gly 


Ala 


Gly 


Gly 


Gly 


Gly 


Arg 


Gly 


Arg 


Gly 


Gly 


Ser 


Gly 


Gly 










325 










330 










335 




Arg 


Gly 


Arg 


Gly 


Gly 


Ser 


Gly 


Gly 


Arg 


Gly 


Arg 


Gly 


Gly 


Ser 


Gly 


Gly 








340 










345 










350 






Arg 


Arg 


Gly 


Arg 


Gly 


Arg 


Glu 


Arg 


Ala 


Arg 


Gly 


Gly 


Ser 


Arg 


Glu 


Arg 






355 










360 










365 








Ala 


Arg 


Gly 


Arg 


Gly 


Arg 


Gly 


Arg 


Gly 


Glu 


Lys 


Arg 


Pro 


Arg 


Ser 


Pro 




370 










375 










380 










Ser 


Ser 


Gin 


Ser 


Ser 


Ser 


Ser 


Gly 


Ser 


Pro 


Pro 


Arg 


Arg 


Pro 


Pro 


Pro 


385 










390 










395 








400 


Gly 


Arg 


Arg 


Pro 


Phe 


Phe 


His 


Pro 


Val 


Gly 


Glu 


Ala 


Asp 


Tyr 


Phe 


Glu 










405 










410 








415 




Tyr 


His 


Gin 


Glu 


Gly 


Gly 


Pro 


Asp 


Gly 


Glu 


Pro 


Asp 


Val 


Pro 


Pro 


Gly 
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Ala He 

Gly Pro 
450 
Phe Gly 
465 

lie Ala 

Thr Thr 

Ser Lys 

Pro Gin 
530 
Pro Gly 
545 

Tyr Phe 

Asp Ala 

lie Arg 

Trp Phe 
610 
Asp Asp 
625 
Glu 



420 
Glu Gin 
435 

Arg Gly 

Lye His 

Glu Gly 

Asp Glu 
500 
Thr Ser 
515 

Cys Arg 

Pro Gly 

Met Val 

lie Lys 
580 
Val Thr 
595 

Pro Pro 
Gly Asp 



Gly Pro 

Gin Gly 

Arg Gly 
470 
Leu Arg 
485 

Gly Thr 

Leu Tyr 

Leu Thr 

Pro Gin 
550 
Phe Leu 
565 

Asp Leu 

Val Cys 

Met Val 

Glu Gly 
630 



Ala Asp 
440 
Asp Gly 
455 

Gin Gly 

Ala Leu 

Trp Val 

Asn Leu 
• 520 
Pro Leu 
535 

Pro Gly 

Gin Thr 

Val Met 

Ser Phe 
600 
Glu Gly 
615 

Gly Asp 



425 

Asp Pro 

Gly Arg 

Gly Ser 

Leu Ala 
490 
Ala Gly 
505 

Arg Arg 

Ser Arg 

Pro Leu 

His He 
570 
Thr Lys 
585 

Asp Asp 
Ala Ala 
Gly Asp 



Gly Glu 

Arg Lys 
460 
Asn Pro 
475 

Arg Ser 

Val Phe 

Gly Thr 

Leu Pro 
540 
Arg Glu 
555 

Phe Ala 

Pro Ala 

Gly Val 

Ala Glu 
620 
Glu Gly 
635 



Gly Pro 
445 

Lys Gly 

Lys Phe 

His Val 

Val Tyr 
510 
Ala Leu 
525 

Phe Gly 

Ser He 

Glu Val 

Pro Thr 
590 
Asp Leu 
605 

Gly Asp 
Glu Glu 



Ser Thr 

Gly Trp 

Glu Asn 
480 
Glu Arg 
495 

Gly Gly 

Ala He 

Met Ala 

Val Cys 
560 
Leu Lys 
575 

Cys Asn 

Pro Pro 

Asp Gly 

Gly Gin 
640 
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